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(57) Abstract 

The invention relates to new expression systems and in particular to an expression system in which a gene of interest is expressed 
at an optimal level. The invention provides a recombinant expression vector comprising a gene of interest and a selectable marker gene, 
wherein the selectable marker gene is arranged downstream of the gene of interest and a stop codon associated with the gene of interest is 
spaced from a start codon of said selectable marker gene at a distance which is sufficient to ensure that translation reinitiation is required 
before said selectable marker protein is expressed from the corresponding MRNA. Examples of such expression systems are vector viral 
packaging cell lines and a number of preferred cell lines have been identified. 
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Expression systems 

The present invention relates to new expressions systems, 
5 and in particular to expression systems in which a gene of 

interest is expressed at an optimal level. Particular 
examples of such expression systems are retroviral packaging 
cell lines and a number of preferred cell lines have been 
identified. 

10 

The ability of eukaryotic and prokaryotic ribosomes to 
reinitiate translation at an internal start codon within an 
mRNA sequence has previously been recognised. Studies have 
been reported in which the efficiency of the process, which 

15 is generally regarded as being low, has been connected with 

the length of the intercistronic sequence (Kozak (1987) 
Mol. Cell Biol. 7, 3438-3445). Selection of this sequence 
or spacer as 70bp in length, and containing no other start 
codons, has been previously reported as being optimal for 

20 reinitiation in a eukaryotic cell line (Cosset F-L., 

Virology (1991) 185, 862) . 

The applicants have found a way in which the inefficiency 
associated with the translation reinitiation process can be 
25 used to good effect. 

According to the present invention there is provided a 
recombinant expression vector comprising a gene of interest 
and a selectable marker gene, wherein the selectable marker 

3 0 gene is arranged downstream of the gene of interest and a 

stop codon associated with the gene of interest is spaced 
from a start codon of said selectable marker gene at a 
distance which is sufficient to ensure that translation re- 
initiation is required before said selectable marker protein 

35 is expressed from the corresponding mRNA. 
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The invention further provides a process for producing cell 
lines in which a gene of interest is expressed, which 
process comprises transforming host cells with an expression 
vector comprising said gene of interest and a selectable 
marker gene, wherein the selectable marker gene is arranged 
downstream of the gene of interest and a stop codon 
associated with the gene of interest is spaced from a start 
codon of said selectable marker gene at a distance which is 
sufficient to ensure that translation re-initiation is 
required before said selectable marker protein is expressed 
from the corresponding mRNA, and selecting those cells where 
expression of the selectable marker gene may be detected. 

Since re- initiation of translation is a relatively 
inefficient process, this means that the selectable marker 
protein will be expressed at lower levels than the product 
of the gene of interest. When the marker protein is 
expressed at detectable levels, the gene of interest will be 
expressed at higher levels. This will ensure that during 
the subsequent selection procedure, only those cell clones 
which express the gene of interest at higher or optimal 
levels will survive. Low expressing clones will be 
eliminated by the selection process. 

Cells transformed with the above -described expression 
vectors form a further aspect of the invention. 

The host cells are suitably eukaryotic or prokaryotic host 
cells, preferably eukaryotic host cells. 

The number of nucleotides in the space between the stop 
codon of the gene of interest and the start codon of the 
selectable marker will suitably be in the range of from 20- 
200 nucleotides, preferably from 60-80 nucleotides, even 
more preferably 70-80 nucleotides. 
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The vectors used in the process of the invention may be any 
of the known types, for example expression plasmids or viral 
vectors . 

5 Selected cells may be cultured and if required, the protein 

product of the gene of interest isolated from the culture 
using conventional techniques. Alternatively, expression of 
the gene of interest may result in other desired effects, 
for example, where the gene of interest is included as part 
10 of a viral packaging construct. 

Some experimental and clinical gene transfer protocols 
require the design of gene transfer vectors suitable for in 
vivo gene delivery (Miller, A.D. 1992. Nature 357: 455- 

15 460) . Retroviral vectors are attractive candidates for 

such applications, because they can provide stable gene 
transfer and expression (Samarut J. et al.,Meth. Enzymol. in 
press) and because packaging cells have been designed which 
produce non- replication competent viruses (Miller A.D (1990) 

20 Hum Gene Ther. 1 5-14) . However currently available 

recombinant retroviruses suffer from a number of drawbacks. 

Packaging cell lines provide in trans the retroviral 
proteins encoded by the gag, pol , and env genes required to 
25 obtain infectious retroviral particles. The gag and pol 

products are respectively the structural components of the 
virion cores and the replication machinery (enzymes) of the 
retroviral particles whereas the env products are envelope 
proteins responsible for the host -range of the virions and 
for the initiation of infection and for sensitivity to 
humoral factors. An ideal packaging cell line should 
produce retroviruses that only contain the retroviral vector 
genome, and absolutely no replication-competent genomes or 
defective genomes encoding some of the viral structural 
35 genes. 



30 
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A number of packaging cell lines designed for human gene 
transfer have been designed in the past by introducing 
plasmid DNAs which contain "helper genomes" encoding gag, 
pol and/or env genes into cells. 

Retroviral packaging cell lines are cells that have been 
engineered to provide in trans all the functions required to 
express infectious retroviral vectors. A helper genome (or 
construct or unit), is herein also referred to as 
-retroviral packaging construct (or unit)." or "packaging- 
deficient construct (or genome unit)" or "gag-pol/env 
expression plasmids" . 

Much efforts has been made to design strategies to optimize 
the helper- genomes in order (i) to get the highest 
production of retroviral packaging functions (which 
correlates which infection titers of retroviral particles) 
and (ii) to minimise the chance that the helper genome can 
be transmitted via the viral particles (which may lead to 
emergence of unwanted retroviral forms) . 

The first of these packaging cell lines used full length 
retroviral genomes as helper genomes that had been crippled 
for important cis -regulated replicative functions (reviewed 
in Miller, Hum. Gene. Ther. 1:5-14 1990). In order to reduce 
the possibility of occurrence of replication-competent 
viruses and of transfer of virus structural genes, a second 
generation of safer packaging cell lines has been designed 
by using two separate and complementary helper genomes which 
express either gag-pol or env and are packaging-deficient 
(Miller supra) . 

The cells into which these helper genomes were introduced 
were isolated by cotransf ecting them with plasmids encoding 
selectable markers. However, as no selection was applied on 
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the packaging-deficient retroviral genome itself, the" helper 
functions can be lost during the passages of the cells in 
culture and the current packaging systems provide limited 
titers of infectious retroviral vectors, usually only of the 
order of 10 5 -10 6 infectious units' i.u/ml. Indeed the 
cotransfection with a plasmid encoding a selectable marker 
does not directly select the best gag-pol-env-expressing 
cells. 
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10 The invention further provides a retroviral packaging cell 

line comprising a host cell transformed with (i) a packaging 
deficient construct which expresses a viral gag-pol gene and 
a first selectable marker gene, and/or (ii) a packaging- 
deficient construct which expresses a viral env gene and a 
second selectable marker gene; wherein a start codon of the 
first and second selectable markers are spaced from the stop 
codons of the viral gag-pol gene and the viral env gene 
respectively by a distance which ensures that reinitiation 
of mRNA translation is required for expression of marker 
protein product of said first and/or second selectable 
marker gene. 



20 



The retroviral packaging cell line may be obtained by the 
above described process which will involve selecting 
25 transfected cells which express said first and/or second 

marker genes. 

By using helper constructs which are directly selectable and 
which provide for high expression of the viral gene, high 
titre retroviral vectors may be obtained. 



30 



Helper constructs for use in the process form a furtner 
aspect of the invention. 

35 The retroviral vectors prepared from the conventional 



WO 97/08330 



PCT/GB96/02061 



e 

packaging cell lines are usually not contaminated by " 
replication-competent retroviruses (RCRs) . However, 
recombinant amphotropic murine retroviruses have been shown 
to arise spontaneously from certain packaging cell lines. 
The generation of such RCRs involves recombination at least 
between gag-pol/env packaging sequence and vector sequences 
(Cosset et al., Virology, (1993) 193:385-395). 

Recombinant RCRs have been associated with the development 
of lymphomas in some severely immunosuppressed monkeys 
(Donahue et al., J. Exp Med (1992) 176: 1125-1135). In 
addition, retroviral vector preparations may also contain, 
at low frequencies, retroviruses coding for functional 
envelope glycoproteins (Kozak and Kabat, 1990, J. Virol. 64: 
3500-3508) or for gag-pol proteins. Although the 
pathogenicity of these gag-pol or env recombinant 
retroviruses is probably low, more evolved recombinant 
retroviruses with higher pathogenic potential may occur when 
injected in vivo, by recombination and/or complementation of 
the initial recombinant viruses with some endogenous 
retroviruses . 

In a preferred embodiment of the retroviral packaging cell 
lines of the invention, the overlapping sequences between 
the genomes of the retroviral vector and the helper 
construct are reduced, for -example as compared to constructs 
such as CRIPenv and CRIPAMgag (Danos et al., Proc. Natl. 
Acad. Sci USA 85: 6460-6464). In particular, the viral 
sequences in the helper construct are reduced, for example, 
not only the packaging sequence but also the 3' Long 
Terminal Repeat (LTR), the 3' non-coding sequence and/or the 
5 ' LTR may be eliminated. 

The' possibility of generation of such RCRs and recombinant 
retroviruses can be reduced by reducing the overlapping 
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sequences between the genomes of both the retroviral vector 
and the helper construct. 

Conventional retroviral vectors are strongly inactivated by 
human serum which makes them of limited or no use for in 
situ gene transfer in gene therapy applications. It has 
previously been shown that inactivation by complement in 
human serum is controlled by the cell line used to produce 
the virions and by viral envelope determinants (Takeuchi et 
al., J. Virol (1994) 68:8001-8007). In particular, 
inactivation is caused by some properties of the cell lines 
that have been used to construct the packaging cells (NIH- 
3T3) and also by viral determinants located in the 
retroviral envelope as shown (Takeuchi et al., J. Virol 
(1994) 68:8001-8007). In vivo gene delivery is an important 
goal for a number of human gene therapy strategies. 

The applicants have found that certain cell lines form 
preferred packaging cell lines. 

Particularly preferred packaging cell lines are the HT1080 
line, the TE671 line, the 3T3 line, the 293 line and the Mv- 
1-Lu line. One example of retroviral packaging cells that 
will produce complement-resistant virus comprise human 
HT1080 cells and express RD114 envelope. Such cells form a 
preferred aspect of the invention. 

Packaging cell lines according to the invention provide 50- 
100 fold increased titers of retroviral vectors as compared 
to conventional packaging cell lines. Retroviral vectors 
provided by these new cells are safe, in terms of generation 
of RCRs, and considerably more resistant to inactivation by 
human complement. 

Packaging cell lines according to the invention may be able 
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to transduce helper-free, human complement-resistant " 
retroviral vectors at titers consistently higher than 10 7 
i . u . /ml . 



Suitable semi -packaging cell lines in accordance with the 
invention are those which express only the gag-pol genes. 
Such cell lines may suitably be derived from TE671, MINK Mv- 
1-Lu, HT1080, 293 or NIH-3T3 cells by introduction of 
plasmid CeB (the MoMLV gag-pol expression unit) . 

Particularly preferred expression vectors in accordance with 
the invention for use' in retroviral packaging cell lines are 
those which include MLV gag and pol genes such as CeB. 
Other plasmids may include gag and pol genes from other 
retroviruses or chimeric or mutated gag and pol genes. 

Various viral and retroviral envelope genes may be included 
in the plasmids such as MLV-A envelope, GALV envelope, VSV-G 
protein, BaEV envelope, RD114 envelope and chimeric or 
mutated envelopes. Plasmids which include the RD114 env 
gene such as FBde 1 PRDSAF as illustrated hereinafter, provide 
one example of suitable constructs. 

The novel retroviral packaging cells described hereinafter, 
have been designated FLY cells, and may be designed for in 
vivo gene delivery. 

Considerable variations were found between the various cell 
lines screened for their ability to release type C mammalian 
retroviruses. In addition, few cell lines were able to 
produce retroviruses completely resistant to human 
complement. Based on these two criteria, human fibrosarcoma 
HT1080 and rhabdomyosarcoma TE671 cells were selected for 
optimum construction of packaging cells. 
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Other studies have shown the importance of endogenous" 
retrovirus expression in the generation of recombinant 
retroviruses from retroviral packaging lines (Ronfort et 
al., Virology, (1995), 207, 271-275, Vanin, E.F.et al., J 
5 Virol (1994) 68:4241-4250.) . The co-packaging of an 

endogenous genome and a vector can lead to emergence of 
recombinant retroviruses (Vanin et al . , supra) . 
Recombination involves template switching during reverse 
transcription of such hybrid retroviruses (Hu et al . , 
10 Science, (1990) 250:1227) and homologies between the ' two 

genomes considerably enhance the frequency of reverse 
transcriptase jumps (Zhang et al., J. Virol. (1994) 68: 
2409-2414) . Therefore an ideal packaging cell line should 
not express endogenous MLV-like (or type C retrovirus -like) 
retroviral genomes which can be packaged by type C gag 
proteins (Scadden et al., J. Virol. (1990) 64: 424-427, 
Torrent et al., J. Mol . Biol. (1994) 240 434-444). 



15 



20 



Packaging of human endogenous retroviral RNA was not 
detected in TELCeB and FLY packaging cells when virion 
associated RNA was analysed by RT-PCR using generic primers. 

HT1080- and TE671 derived packaging cell lines may be safer 
in this respect than those generated from NIH3T3 cells, such 
as GP+EAM12 cells, which are known to express and package 
25 sequences related to type C retroviruses (Scadden et al . 

supra) . 

To generate the FLY packaging cell lines, HT1080 cells were 
transfected with gag-pol and env expression plasmids 

30 designed to optimise viral protein expression. Direct 
selection for viral gene expression was achieved in 
accordance with the invention by expression of a selectable 
marker gene by re- initiation of translation of the mRNA 
expressing the viral proteins. This strategy resulted in 

35 packaging cell lines capable of producing extremely high 
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titer viruses. Furthermore, long-term expression of " 
packaging functions can be maintained in these cells. Many 
unnecessary viral sequences were eliminated from the 
packaging constructs to reduce the risk of helper virus 
generation; indeed the final packaging cells did not produce 
helper virus, in that no replication competent virus (RCR) 
could be detected per 10 7 vector particles. 

The FLY packaging cells described herein are safer than, for 
example, psiCRIP cells, at least for generation of env 
recombinant retroviruses as is illustrated in Table 4 
hereinafter, probably because less retroviral sequences 
overlapping with the vector were present in the present env- 
expression plasmid. Few reports have addressed the question 
of the characterization of recombinant retroviruses (RVs) 
(Cosset, F.L., et al., Virology (1993) 193:385-395). It is 
possible that such RVs could not be detected in previous 
packaging cell lines due to lower overall titers. RVs are 
defective in normal cell culture conditions but are likely 
to evolve to replication competent viruses if they are 
allowed to replicate in cells complementing their expression 
like co-cultivated packaging cells (Bestwick et al., Proc. 
Natl Acad Sci USA, (1988) 85: 5404-5408, Cosset et al., 
(1993) supra) . 

In preferred retroviral packaging systems according to the 
invention, RVs are eradicated for example by removal of 
viral LTRs from the packaging construct. 

Consistent with our previous studies (Takeuchi, Y. , et al., 
J Virol (1994) 68:8001-8007), Lac2(RD114) and lacZ (MLV-A) 
pseudotypes produced from HT1080 and TE67lcells were more 
resistant to human complement than LacZ(RDll4) or LacZ (MLV- 
A) pseudotypes produced by 3T3 of dog cells. It was 
therefore decided to use RD114 and MLV-A env genes to 
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11 

generate recombinant virions with MoMLV cores. 

The sequence of RD114 env gene was determined and is shown 
in Figure 4. It was found to be very close to BaEV (baboon 
endogenous virus) a type C retrovirus (Benveniste, R.E.et 
al., Proc. Natl. Acad. Sci. USA (1973) 70:3316-3320; Kato, 
S.et al. # Japan. J. Genet. (1987) 62:127-137) with an 
envelope gene displaying similarities to the external part 
of type D simian retroviruses (SRVs) . RD114 uses the SRV 
receptor on human cells (Sommerfelt & Weiss, Virology 
(1990) 176:58-69; Sommerfelt, M.A. et al., J Virol (1990) 
64:6214-6220) making the FLY packaging cells with RD114 
envelope capable of generating virions with different 
tropism. Retroviral vectors prepared so far for human gene 
15 therapy have used either MLV-A or GALV (gibbon ape leukemia 

virus) envelopes which display some similarities (Battini, 
J.L.,et al., J Virol. (1992) 66:1468-1475) and which use two 
related cell surface receptors for infection (Miller, D.G. 
et al., J Virol (1994) 68:8270-8276). Differences in tissue- 
specific expression of MLV-A or GALV receptors have been 
reported (Kavanaugh et al., Proc Natl Acad Sci USA (1994) 
91:7071-7075) . 



20 



The invention will now be particularly described by way of 
25 example with reference to the accompanying drawings in 

which: 

Figure 1 . illustrates the structure and expression of CeB. 
The env gene (Xbal-Clal) of plasmid pCRIP was removed and 
30 was replaced by coinsertion of the two fragments Xbal-Sfil 

(restriction sites underlined) from pOXEnv and a Sfil-Clal 
PCR product containing the bsr selectable marker. This " 
results in positioning the bsr start codon (shadowed) 74 bp 
downstream to the pol stop codon (bold) . 

35 



WO 97/08330 



PCT/GB96/02061 



12 



Open triangle are start codons (gas and bsr) , black " 
triangles are stop codons (pol and bsr) . The shadowed 
triangle is the start codon of env, in the same reading 
frame with that of bsr. SD and SA are the splice donnor and 
splice acceptor sites. 

Figure 2 illustrates the structure and expression of 
FbdelPASAF. 

Immediately after the stop codon of ejw (bold) was inserted 
a non retroviral Kasl-Ncol (restriction sites underlined) 
linker which positions the phleo start codon (shadowed) 76 
bp downstream. 

Open triangle are start codons (env and phleo ) , black 
triangles are stop codons (env and phleo ) . SD and SA.are 
the splice donnor and splice acceptor sites. 

Figure 3 illustrates plasmids for expression of Ampho, Eco, 
RD114, Xeno, 10A1, GALV, VSV-G and FeLVB envelopes. 
All genes are expressed in the same backbone as detailed in 
fig- 2. The Bglll sites for ecotropic (MoMLV strain), 
10A1, xenotropic (NZB.1.V6 strain) and amphotropic (4070A 
strain) , the Ndel site of RD114 (SC3C strain, the BamHl site 
for both FeLVB and GALV were used as 5' ends, and linked to 
Mscl site immediately after the splice donor site in the 
leader of FB29 LTR. 

Figure 4 shows the sequence of the RD114 env gene (SEQ ID No 
1) 

Figure 5 shows the genetic structure of gag-pol constructs. 
Initiation 0 and termination (T) codons are shown. The 
thick dotted line below each construct shows MLV-derived 
sequences. Nucleotide positions of MLV-derived sequences are 
shown according to: Shinnick et al. (1981) (from nt 1 to nt 
6000 with deletion of the packaging signal (DY) from Ball 
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(nt 215) to PstI (nt 568), and with some further MoMLV 
sequences in both CeB and CeB DS- from nt 7676 to nt 7938. 
gag-pol and bsr genes were expressed from the same 
transcription unit using the either a retroviral promoter 
(Mo LTR) or a non retroviral promoter (hCMV) and non 
retroviral polyadenylation sequence (polyA) . Splice donor 
(SD) and acceptor (SA) sites are indicated. The thin line 
denotes retroviral non coding sequences. The thick line 
shows the rabbit beta-l globin intron B. The position of 
some restriction sites is indicated. 

The nucleic acid sequences of portions of constructs (as 
shown in Figure 5 (boxed areas) ) are displayed for CeB (SEQ 
ID No 2, Figure 6), hCMV+intron (SEQ ID No 3, Figure 7) and 
hCMV+intronkaSD (SEQ ID No 4, Figure 8) . 



The nucleic acid sequences of portions of constructs (as 
shown in Figure 3 (boxed areas) ) are displayed for 
FbdelPASAF (SEQ ID No 5, Figure 9), FbdelPMOSAF (SEQ ID No 
6, Figure 10), FbdelPGASAF (SEQ ID No 7, Figure 11), 
FbdelPRDSAF. (SEQ ID No8, Figure 12) and CMV10A1 (SEQ ID No 
9, Figure 13) are shown. 

The components of the viral particles are produced by two 
independent expression plasmids (gag-pol or env) which also 
contain selectable markers (bsr or phleo) expressed from the 
same transcriptional units as gag-pol or env (figs. 1& 2) . 
The selectable markers are located downstream to gag-pol or 
env genes and there is an optimal distance between the stop 
codon of the upstream reading frames and the start codon of 
the selectable genes that should allow re-initiation of 
translation (Kozak, Mol Cell Biol. (1987) 7, :3438-3445) . 
Because there is no "Kozak" sequence (Kozak, Cell, (1986) 44: 
283-292) required for a normal initiation of translation for 
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the marker gene, they can only be expressed by re-initiation 
of translation after the upstream viral gene has been 
successfully expressed. Consequently and also because re- 
initiation of translation is a poorly efficient process, 
after transfection of these plasmids, cells resistant to the 
drugs corresponding to those selectable genes express high 
levels of the viral proteins. 

To avoid viral transmission of these "helper" genomes the 
constructs used suitably have the classical deletions of 
both the packaging, sequence located in the leader region and 
of the 3'LTR, the latter being replaced by SV40 
polyadenylation sequences (Figs 1 & 2) . 

Plasmid CeB is the MoMLV gag-pol-expression .unit . It 
derives from pCRIP, a plasmid used to generate the 
constructs introduced in the CRIP and CRE packaging cell 
lines (Danos and Mulligan, 1988). As shown in fig. l for 
generation of plasmid CeB the env gene of pCRIP has been 
deleted mostly and the bsr selectable marker, -encoding a 
protein conferring resistance to blasticidin (Izumi et al., 
Experimental Cell Research (1991) 197, 229-233)- has been 
inserted downstream to pol gene. There are exactly 74 bp 
with no ATG triplets between the stop codon of p^l and the 
start codon of bsr, this allows its expression by re- 
initiation of translation on the gag-pol mRNA, after- 
translation of the gag -pol reading frame. 

FbdelPASAF is a plasmid expressing the amphotropic env gene 
and the phleo selectable marker conferring resistance to 
phleomycin (Gatignol et al . , FEBS Letters (1988) 230:171- 
175) . By using a PCR-mediated mutagenesis strategy which 
modifies the end of env gene (see fig. 2), a 76 bp linker 
was inserted between the stop codon of env and the start 
codon of phleo . This allows expression of phleo from the 
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env mRNA by re-initiation of translation. In addition 
compared to known env- expressing constructs, this strategy 
of construction has reduced the length of sequences 
overlapping with the ends of conventional retroviral 
vectors. The env genes of Mo-MLV, FeLVB, NZB.1V6, 10A1, 
GALV and RD114 are expressed by plasmids FBdelPMoSAF, 
FBdelPBSAF, FBdelXSAF, FBdelpGSAF, FBdelpl 0 A1SALF and 
FBdelPRDSAF , respectively, by using the same backbone as 
FBdelPASAF (fig. 3). Retroviral vectors produced with the 
RD114 envelope, will be useful for in vivo gene delivery as 
comparatively to MLV ecotropic or amphotropic envelopes, 
virions pseudotyped with RD114 envelopes are not inactivated 
by human complement when they are produced by Mink Mv-l-Lu 
cells or by some human cells (Table 1) . 



The HT1080 cell line, isolated from a human fibrosarcoma 
(ATCC CCL121) . The TE671 cell line isolated from a human 
rhabdomyosarcoma (ATCC CRL 8805) (purchased from ATCC, and 
tested for absence of usual cell culture contaminants by 
ECACC) , has been used for the definitive construction of 
packaging cell lines. HT1080 line was chosen among a panel 
of primate and human lines because MLV-A and RD114 
efficiently rescued retroviral vectors from these cells and 
also because RD114 pseudotypes produced by this cell line 
25 were stable when incubated in human serum. In a standard 

assay (Takeuchi et al . , J Virol (1994), 68, 8001-8007), 
these latter viruses were found more than 500 fold more 
stable than similar pseudotypes produced in 3T3 cells. 

30 Another advantage for the use of non murine cells to derive 
packaging lines is the absence of MLV-related endogenous 
retroviral -like sequences (like VL30 in 3T3 cells) that can 
cross-package with MLV-derived retroviral vectors (Torrent 
et al., 1994) and generate potentially harmful recombinant 

35 retroviruses. 
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The helper constructs were introduced into other cell" lines 
(HT1080 (table 2) Mink Mv-l-Lu (table 2), 3T3 (not shown), 
TE671 (table 2)) for the purpose of comparisons of the 
efficiency of the constructs. 



As illustrated hereinafter (Table 2), the reverse 
transcriptase (RT) activity (provided by expression of the 
pol gene) in cells transfected with CeB is significantly 
higher than that of the same cells transfected by the 
parental plasmid pCRIP or that of cells chronically infected 
by MLV. This enhancement of viral gene expression is 
correlated with the titers of lacZ retroviral vectors when 
an envelope is provided in CeB-lacZ cells after comparison 
with titers of lacZ pseudotypes of either replication- 
competent viruses or other helper- free packaging systems. 

For the generation of final packaging cell lines, the best 
clonal env trans.f ectants have been selected. Packaging 
systems obtained in this way will be able to produce helper- 
free retroviral vectors at titers greater than 10" 
infectious particles per ml, which would be 10-100 fold 
higher to helper-free preparations of others. 

Because of the way the selectable markers are expressed (see 
above) , growing the packaging cells in phleomycin and 
blasticidin selective pressure increase and stabilize the 
expression of the retroviral components and particularly the 
envelopes, as it is possible that env glycoproteins have 
toxic effects for the producer cells in the long term which 
may lead to a decrease of expression. 

Such an enhancement of viral production observed with the 
packaging systems described herein might increase the 
emergence of unwanted retroviruses having recombined between 
the genomes of both the retroviral vector and either of the 
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two packaging-deficient constructs. However, the constructs 
have been designed in such a way that it reduces the 
probability of emergence of recombinant viruses compared to 
the parental constructs. To check their safety, attempts 
5 have been made to detect the presence of replication- 

competent retroviruses by a mobilisation assay of a lacZ 
provirus. No RC viruses have been found in all retroviral 
vector preparations tested so far. 

10 The following Examples illustrate the invention. 

Example 1 

Preparation of Cell lines and viruses. 
15 The following cell lines were used: 

A204 (ATCC HTB 82), HeLa (ATCC CCL2) , HT1080 (ATCC CCL121) , 
MRC5 (ATCC CCL171), T24 (ATCC HTB 4), VERO (ATCC CCL81) and 
D17 (ATCC CCL183) were purchased from ATCC. 

20 HOS, TE671 and Mv-l-Lu cells and their clones harboring 

MFGnlslacZ retroviral vector as described by Takeuchi et 
al., J Virol (1994), 68, 8001-8007. 

The above cell lines were grown in DMEM (Gibco-BRL, U.K.) 
25 supplemented with 10% fetal calf serum. 

EB8 (Battini et al., J. Virol (1992) 66: 1468-1475); 
psiCRE, psiCRELLZ and psiCRIP (Danos et al., Proc. Natl. 
Acad. Sci USA (1988) 85: 6460-6464); 
30 Cells GP+EAM12 (Markowitz et al., Virology (1988), 167, 400- 

406); and 

NIH-3T3 murine fibroblasts. 

These cell lines were grown in DMEM (GIBCO-BRL, U.K.) 
35 supplemented with 10% new-born calf serum. 
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Mv-l-Lu, TE671 and HT1080 cells were transfected using 
calcium-phosphate precipitation method (Sambrook et., 
"Molecular Cloning" 1989, Cold Spring Harbour Laboratory 
Press: New York) as described elsewhere (Battini et al., 
supra) . CeB-transfected Mv-l-Lu, TE671 and HT1080 cells 
were selected with 3, 6-8 and' 4 /xg/ml of blasticidin S (ICN, 
UK) , respectively, and blasticidin-resistant colonies were 
isolated 2-3 weeks later. Cells transfected with the various 
env-express.ion plasmids were selected with phleomycin 
(CAYLA, France) : 50 ng/ml (for FBASALF- transfected cells) or 
10 fig/ml (for FBASAF-, FbdelPASAF- , FbdelPMOSAF , 
FBdelPIOAISAF or FBdelPRDSAF- transfected cells) . Phleomycin- 
resistant colonies were isolated 2-3 weeks later. 

Production of lacZ pseudotypes using replication competent 
viruses, amphotropic murine leukemia virus (MLV-A) 1504 
strain and cat endogenous virus RD114, was carried out as 
described previously (Takeuchi et al., J Virol (1994), 68, 
8001-8007) . 

Example 2 

Preparation of Plasmids. 

The env gene of pCRIP (Danos et al., supra) was excised by 
Hpal/Clal digestion. A 500 bp PCR-generated DNA fragment was 
obtained using pSV2-bsr (Izumi et al., Experimental Cell 
Research (1991), 197, 299-233) as template and a pair of 
oligonucleotides: 

(5 ' >CGGAATTCGGATCCGAGCTCGGCCCAGCCGGCCACCATGAAAACATTTAACATTTC 
TO (SEQ ID NO 2) at 5' end and 

( 5 ' >GATCCATCGATAAGCTTGGTGGTAAAACTTTT) (SEQ ID No 3) at 3 ' 
end, with Sfil and Clal sites, respectively. This fragment 
was inserted in Hpal/Clal sites of pCRIP by co-ligation with 
a 85 bp Hpal/Sfil DNA fragment isolated from pOXEnv (Russell 
et al., Nucleic Acids Research (1993), 21, 1081-1085) which 
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provides the end of the Moloney murine leukemia virus" 
(MoMLV) pol gene. The resulting plasmid named CeB (Fig. 1} 
could express the MoMLV gag-pol gene as well as the bsr 
selectable marker conferring resistance to blasticidin S, 
5 both driven by the MoMLV 5'LTR promoter. 

A series of env-expression plasmids was generated using the 
4070A MLV (amphotropic) env gene (Ott et al.., j Virol 

(1990) , 64, 757-766) and the FB29 Friend MLV promoter 

10 (Perryman et al., Nucleic Acid Res (1991), 19, 6950). In 

FBASALF (Fig. 1) a Bglll/Clal fragment containing the env 
gene was cloned in BamHI/Clal sites of plasmid FB3LPh which 
also contained the C57 Friend MLV LTR driving the expression 
of the phleo selection marker. A 136 bp env fragment was 

15 generated by PCR using plasmid FB3 (Heard et al., j Virol 

(1991) , 65, 4026-4032) as template and a pair of 
oligonucleotides: (5' >GCTCTTCGGACCCTGCATTC) (SEQ ID NO 4) at 
5' end (before Clal site) and 

(5 ' >TAGCATGGCGCCCTATGGCTCGTACTCTATAGGC) (SEQ ID NO 5) at 3' 
end, providing a KasI restriction site immediately after the 
env stop codon. This PCR fragment was digested using Clal 
and KasI. A DNA fragment containing the FB29 LTR and the 
MLV-A env gene was obtained by Ndel/Clal digestion of 
FBASALF. The fragments were co-ligated in Ndel/KasI digested 
pUT626 (kindly provided by Daniel Drocourt, CAYLA labs, 
France). In the resulting plasmid, named FBASAF (Fig. l) , 
the phleo selectable marker was expressed from the same mRNA 
as the env gene. A Bglli restriction site was created after 
the Mscl site at position 214 in the FB29 leader by using a 
30 commercial linker (Biolabs, France) . A Ndel/Bglli fragment 

containing the FB29 LTR was co-inserted with the Bglll/Clal 
env fragment in Ndel/Clal -digested FBASAF plasmid DNA, 
resulting in plasmid FBdelPASAF (Fig. l) . Compared to 
FBASAF, FBdelPASAF has a lOObp larger deletion in the leader 
35 region. 
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Example 3 



Cloning and Sequencing of the RD114 env gene 
The RD114 env gene was first sub-cloned in plasmid 
Bluescript KS+ (Stratagene) as a 3 Kb Hindlll insert 
isolated from SC3C, an RD114 infectious DNA clone (Reeves et 
al., J. Virol (1984), 52, 164-171). A 2.7 kb Seal-Hind III 
fragment of this subclone containing the RD114 env gene was 
sequenced (Figure 4 (SEQ ID NO 1)- EMBL accession number; 
X87829). The 5' non-coding sequence upstream of an Ndel site 
was deleted by an EcoRl/Ndel digestion followed by filling- 
in with Klenow enzyme and self -ligation. From this plasmid, 
two DNA fragments were obtained: a BamHI/Ncol 2.5 Kb 
fragment and a 63 bp PCR-generated DNA fragment using 
(5 ' >CGCCTCATGGCCTTCATTAA) (SEQ OD NO 6) at 5' end (before 
NotI site) and (5 ' >TAGCATGGCGCCTCAATCCTGAGCTTCTTCC) (SEQ ID 
NO 7) at 3' end, providing a KasI restriction site just 
after RD114 env gene stop codon. The PCR fragment was 
digested with Ncol and KasI. Both fragments were co- 
inserted between Bglll and KasI sites of FBdelPASAF and the 
resulting plasmid was named FBdelPRDSAF (Fig. l) . 

Plasmid pCRIPAMgag- (Danos, O. et al., Proc Natl Acad 
Sci USA (1988) 85:6460-6464) was used for transf ection. 

Example 4 
Infection assays. 

Target cells were seeded in 24-multiwell plates (4xl0 4 cells 
per well) and were incubated overnight. Infections were then 
carried out at 37°C by plating 1 ml dilutions of viral 
supematants in the presence of 4 M g/ml polybrene (Sigma) on 
target cells. 3h later virus -containing medium was replaced 
by fresh medium and infected cells were incubated for two 
days before X-gal staining, performed as previously 
described (Tailor et al., J Virol (1993), 67, ' 6737-6741, 
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Takeuchi et al . , J Virol (1994), 68, 8001-8007). Viral 
titers were determined by counting lacZ-positive colonies as 
previously described (Cosset et al. , J. Virol. (1990) 64: 
1070-1078). Stability of lacZ pseudotypes in fresh human' 
serum was examined by titrating surviving virus after 
incubation in 1:1 mixture of virus harvest in serum- free 
medium and fresh human serum for 1 h at 37°C as described 
before (Takeuchi et al. supra) . 



Example 5 

Reverse transcriptase (RT) assay. 

RT assays were performed either as described previously 
(Takeuchi et al . supra) or using an RT assay kit (Boehringer 
Mannheim, U.K.) following the manufacturer's instruction but 
using MnCl 2 (2 mM) instead of MgCl 2 . 

Example 6 



Screening producer cell lines. 

Viral particles generated with RD114 envelopes have been 
found to be more stable in human serum than virions with 
MLV-A envelopes and that the producer cell line also 
controls sensitivity (Takeuchi et al. supra) . A panel of 
cell lines was screened for their ability to produce high 
titer viruses and for the sensitivity of these virions to 
human serum. To do this, cells were infected at high 
multiplicity with lacZ pseudotypes of either MLV-A or RD114 
and cells producing helper-positive lacZ pseudotypes were 
established. Human HT1080 and TE671 and mink Mv-l-Lu cells 
were found to release high titer lacZ(RD114) and lacZ (MLV-A) 
viruses. LacZ (MLV-A) pseudotypes produced by HT1080 cells 
were more resistant to human serum than those produced by 
other cells. The titer of these viruses was only four-fold 
less following a 1 hr incubation with human serum than a 



WO 97/08330 



PCT/GB96/02061 



22 

control incubation (Table 1). LacZ(RD114) pseudotypes" 
produced by human cells or mink Mv-l-Lu cells were in 
general stable in human serum (Table 1) . These results 
suggested that HT1080, TE671 and Mv-l-Lu cells provided the 
best combination of high lacZ titers and resistance to human 
serum and they were therefore used for the generation of 
retroviral packaging cells. 

Table 1. Titer and stability of lacZ pseudotypes. 



Producer 
cell 



LacZ(MLV-A) 



Titer a 



LacZ (RD1 14) 



Stability 1 * Titer a 



Stability 11 



A2 04 


650 


<3 


1,200 


105 • 


HeLa 


9 


nd 


2, 000 


115 


HOS 


4,500 


6 


23,000 


86 


HT1080 


2, 000, 000 


26 


400, 000 


129 


MRC-5 


450 


10 


1,000 


nd 


T24 


350 


nd 


1,200 


nd 


TE671 


15, 000 


2 


90,000 


38 


VERO 


260 


nd 


90 


nd 


D17 


900 


<1 


200,000 


1 


Mv-l-Lu 


80,000 


1 


200,000 


120 



b: % of infectivity of human serum- treated viruses compared to fetal calf serum- 
treated viruses 



Example 7 



Construction of an improved gag-pol expression vector. 
A MoMLV gag-pol expression plasmid, CeB (Fig. i) , was 
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derived from pCRIP (Danos et al., Proc. Natl. Acad Aci USA 
(1988) 85: 6460-6464). Approximately 2 Kb of env sequence 
were removed from pCRIP and the bsr selectable marker, 
conferring resistance to blasticidin S (Izumi et al., 
Experimental Cell Research (1991) 197:229-233), was inserted 
74 nts downstream of the gag-pol gene. This 74 nts interval 
had no ATG triplets and was thought to provide an optimal 
distance between the stop codon of the pol reading frame and 
the start codon of the bsr gene to allow re -initiation of 
translation (Kozak Mol Cell Biol., 1987, 7: 3438-3445). 
There was no "Kozak" consensus sequence (Kozak Cell, (1986) 
44: 233-292) at the 5' end of the marker gene. Therefore, 
bsr could only be expressed by re -initiation of translation 
after the upstream gag-pol gene had been expressed. 
Consequently, after transfection of CeB in Mv-1- 
Lu/MFGnlsLacZ (ML) , TE67l/MFGnlsLacZ (TEL) or HT1080 cells, 
blasticidin S -resistant bulk populations and most cell 
clones expressed high levels of gag-pol proteins assessed by 
the reverse-transcriptase (RT) activity found in cell 
supernatants (Table 2). Considerably higher RT activities 
were found in bulk populations of CeB-transf ected ML cells 
compared to bulk population of ML cells stably transf ected 
with the parental pCRIP construct. Similarly the RT 
activities of two packaging cell lines generated using 
pCRIPenv- construct, psiCRE cells (Danos et al., supra) and 
EB8 cells (Battini supra.) were less than that of CeB 
transf ected clones (Table 2). Finally, RT.activitiy in CeB 
transfected cell supernatants was higher than that of cells 
chronically infected by replication- competent MLV-A (Table 
2) . 

Table 2. Secreted reverse transcriptase expression 
Cell 8 RT activity" LacZ Titer c 
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ML/MLV-A 


1 


8xl0 4 


MLSvR 


0 . 1 


<1 


MLCRTP /Knll.\ 
iiUUI\.X±r IDUIK) 


0 . 15 


nd 


JYUjLeB (bulk) 


1 . 7 


nd 


i v ii_iCeo J. 


4 . 2 


1x10 s 




1. 6 


1x10 s 


I.fci.b/MLV-A 


3 . 6 


2xl0 6 




5 . 2 


4xl0 7 


niiuou/r'iijV - A 


1 . 1 


lxlO 6 


HTCeBo 


1.9 


1x10 6 


HTCeB18 


2.7 


2x10* 


HTCeB22 (FLY) 


6.9 


5xl0 6 


HTCeB4 8 


5.5 


3xl0 6 


EB8 


0.22 


lxlO 4 


psiCRE-LLZ 


1.2 


lxl0 5d 



a: ML, Mv-l-Lu cells harboring a MFGnlslacZ provirus; TEL, TE671 cells harboring 
a MFGnlslacZ provirus; /MLV-A, cells chronically infected- with MLV-A 1S04 
strain; MLSvB, ML cells transfected with a plasmid pSV2bsr alone; MLCRIP, ML 
cells co-transfected with pCRIP and pSV2bsr. 

b: Average of arbitrary units relative to ML/MLV-A RT activity of at least two 
independent experiments was shown. The standard errors did not exceed 20 % of 
the values. 

c: titration on TE671 cells as lacZ i.u./ml. After polyclonal transfection of a 
plasmid which expresses MLV-A env in MLCeB clones, TELCeB clones, HTCeB clones 
and EB8 cells; nd, not done, 
d: titration on NIH3T3 cells 

To rescue infectious lacZ viruses, MLCeB and TELCeB clones 
were transfected with FBASALF DNA, a plasmid designed to 
express the MLV-A env gene (Fig. l) . Bulk populations of 
stable FBASALF transf ectants were isolated and supernatants 
were titrated using TE671 cells as targets. Titers of lacZ 
viruses were higher than either MLV-A infected ML or TEL 
cells, or FBASALF-transfected EB8 cells (Table 2). These 
data suggested that CeB was an extremely efficient MLV gag- 
pol expression vector in mink Mv-l-Lu and TE671 cells. CeB 
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was therefore used to derive packaging cells by transaction 
of HT1080 cells. 41/49 blasticidin S-resistant colonies had 
detectable levels of RT; 9 had RT activity higher than that 
of control MLV-A- infected HT1080 cells (data not shown) . 
Expression of gag precursor was confirmed in cell lysates 
and supernatants of these 9 HTCeB clones by immunoblotting 
using antibodies against p30-CA (data not shown) The 4 
clones with the highest expression of gag proteins (clones 
6,18,22 and 48) were infected at high-multiplicity with 
helper free, lacZ pseudotypes bearing MLV-A envelopes 
(MFGnlslacZ (A) ) produced by TELCeB 6/ FBASALF (Table 3) and 
then transfected with FBASALF. Supernatants of bulk, 
phleomycin-resistant transf ectants were assessed for RT 
activity and lacZ titer (Table 2) . Clone HTCeB 2 2 , named FLY, 
was found to be the best gag-pol producer clone and was used 
to introduce env expression vectors for the generation of 
packaging cell lines. 
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Table 3. Titer following env construct transfection 



Producer cell 



Env source 



Titer* 



psiCRIP lacZ 5 
GP+EAM12 lacZ 25 
TELCeB6 

TELCeB6 



FLY* 1 



FLYA4 lacZ 3 
FLY* 1 



pCRIPAMgag- 

envAM 

FBASALF C 

FBASAF C 

FbdelPASAF c 

FBdelPASAF 1 
FbdelPASAF 4 
FbdelPASAF 6 
FbdelPASAF 7 
FbdelPASAF 8 
Fbde 1 PRDSAF 2 
FbdelPRDSAF 4 
Fbde 1 PRDSAF 7 
FbdelPRDSAF 8 

FBdelPASAF 1 

FbdelPASAF 4 

FbdelPASAF 5 

FbdelPASAF 7 

FbdelPASAF 13 

FbdelPASAF 14 

FbdelPASAF 15 

FbdelPASAF 16 

FbdelPASAF 17 

FBdelPASAF 4 

FBdel PRDSAF 1 
FbdelPRDSAF 2 
FbdelPRDSAF 6 
FbdelPRDSAF 10 
FbdelPRDSAF 11 
FbdelPRDSAF 13 
FbdelPRDSAF 17 
FbdelPRDSAF 18 
FbdelPRDSAF 19 



6x1 0 413 

3x1 0 5b 

5xl0 7 
2xl0 7 
2xl0 7 



3xl0 7 
2xl0 7 
IxlO 7 
5xl0 7 
IxlO 7 
IxlO 6 
3x10 s 
IxlO 7 
2xl0 6 

lxl 0 1 

1.5x10 s 

IxlO 6 

IxlO 6 

7xl0 6 

4xl0 6 

IxlO 6 

5xl0 6 

6xl0 6 

2xl0 7b 

2.5xl0 6 

IxlO 7 

5x10 6 

2xl0 6 

3xl0 6 

IxlO 6 

5xl0 6 

3xl0 7 

6xl0 6 



Average titers of at least three independent experiments were shown. The 

standard errors did not exceed 30 % of the titer values. 

a: titrated on TE671 cells as lacZ i.u./ml 

b: results of best MFGnlslacZ producer clones. 
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c: bulk populations of env- transf ectants in TELCeBS cells. 

d: titration after bulk infection with helper- free MFGnlslacZ. 

Example 8 

Construction of env expression vectors. 
A series of MLV-A env expression plasmids were then 
generated (Fig. 1) . m FBASALF, the env gene was inserted 
between two Friend-MLV. LTRs, its expression driven by the 
FB29 MLV LTR (Perryman et al., supra) . Most of the packaging 
signal located in the leader region was deleted. This 
plasmid also expressed the phleo selectable marker (Gatignol 
et al., supra) driven by the 3' LTR. FBASAF and FBdelPASAF 
were then designed following the same strategy used for CeB. 
These two vectors differed only by the extent of deletion of 
the packaging signal, FBdelPASAF having virtually no leader 
sequence. Compared to pCRIPAMgag- and pCRIPgag-2 env 
plasmids expressed in psiCRIP or psiCRE packaging cells 
(Danos et al., supra) about 5 Kb of gag-pol sequences was 
removed. In addition the 258 bp retroviral sequence 
containing the end of env gene and the begining of U3 found 
in pCRIPAMgag- and pCRIPgag-2 was also removed. For both 
FBASAF and FBdelPASAF plasmids, the phleo selectable marker 
was inserted downstream of the env gene by positioning a 76 
nts linker with no ATG codons between .the two open-reading 
frames. Phleo could therefore only be expressed by re- 
initiation of translation by the same ribosomal unit that 
had expressed the upstream env open reading frame. 
FBdelPASAF was also used to generate FBde 1 PRDSAF , an RD114 
envelope expression plasmid (Fig. 1) . 

After transfection of the env plasmids into TELCeB6 cells 
(Table 2) , bulk populations of phleomycin-resistant colonies 
were isolated and their production of lacZ virus measured 
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(Table 3) . FBASALF gave a titer of 5xl0 7 lacZ-i . u . /ml / whilst 
titers with either FBASAF or FBdelPASAF were 2xl0 7 lacZ- 
i.u./ml (Table 3). Titers of 5xl0 7 or 10 7 lacZ-i .u. /ml could 
be obtained with some FBdelPASAF cell clones or FBdelPRDSAF 
clones, respectively. 

As FBdelPASAF has minimal virus -derived sequences and was 
shown to be the safest construct (see below and Table 4), 
it and FBdelPRDSAF were used to generate packaging lines' 
from FLY cells (clone HTCeB22 , Table 2). Envelope expression 
of these clones was assayed by interference to challenge 
with MFGnlslacZ (A) or MFGnlslacZ (RD) pseudotypes produced by 
TELCeB6/FBdelPASAF-7 or TELCeB6/FBdelPRDSAF-7 , respectively 
(Table 3). The cell lines showing most interference were 
cross-infected at high multiplicity with these pseudotypes 
to provide MFGnlslacZ proviruses, and supernatants were then 
titrated on TE671 cells (Table 3) . FLY- FBdelPASAF- 13 (FLYA13 
packaging line) and FLY - FB de 1 PRDS AF - 1 8 (FLYRD18 packaging 
line) gave the highest productions of lacZ viruses, around 
10 7 lacZ-i.u./ml. The best MFGnlslacZ producer clones derived 
from either psiCRIP cells (Danos et al., supra) or GP+EAM12 
cells (Markowitz et al., supra) gave approximately 50 fold 
lower titers (Table 3). The lacZ titers of the FLY-derived 
lines shown in Table 3 are lower than the best TELCeB6- 
derived lines after transfection of either FBdelPASAF or 
FBdelPRDSAF (Table 3) . However it should be noted that the 
lacZ provirus expressed in TELCeB6 cells was obtained after 
clonal selection but was introduced polyclonally in FLY- 
derived env-transfected cell clones. When FLY -FBdelPASAF -4 
cells (FLYA4 packaging line) , infected with helper-free 
MFGnlslacZ (RD) , were cloned by limiting dilution the best 
clones (eg. FLYA4lacZ3) were found to produce 20 times more 
infectious viruses than the bulk population, reaching the 
range of titers obtained with the best TELCeB 6 -FBdelPASAF 
clones (Table 3) . 
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Example 9 

Assays for transfer of gag-pol or env functions. 
To assay for replication-competent viruses, supernatants 
were used to infect TEL cells (a clone of TE671 cells 
harboring an MFGnlslacZ provirus) . Infected cells were 
passaged for 6 days or longer and their supernatants were 
used for infection of fresh TEG 71 cells. No transmission of 
lacZ viruses could be detected (Table 4), demonstrating that 
the supernatants of pCRIPAMgag-- , FBASALF-, FBASAF-, or 
FBdelPASAF-transfected TELCeBG cells were helper-free. 
Similar absence of replication competent recombinant 
retroviruses was demonstrated using supernatant from a 
clone of psiCRIP-MFGnlslacZ cells or from two clones of 
FLYA-MFGnlslacZ cells (Table 4) . 

There have been reports that helper- free retroviral vector 
stocks may nevertheless contain recombinant retroviruses 
(replication incompetent) carrying either gag-pol or env 
genes (Bestwick et al. f Proc Natl Acad Sci USA (1988), 85, 
5404-5408, Cosset et al., Virology (1993), 193, 385-395, 
Girod et al., Virology (1995), in press) . To assay for such 
recombinant retroviruses, mobilisation of an MFGnlslacZ 
provirus from two indicator cell lines which could cross - 
complement potential recombinant viruses carrying either 
gag-pol or env functional genes was attempted. The TELCeBS 
line (Table 2) expressing gag-pol proteins was used as 
indicator cell line to test for the presence of env 
recombinant (ER) viruses. The TELMOSAF indicator line 
expressing MoMLV env glycoproteins (obtained by transfection 
of FBMOSAF, a plasmid expressing the MoMLV env, gene using 
FBASAF backbone, in TEL cells) was used to detect the 
presence of gag-pol recombinant retroviruses (GPR viruses) . 
After passaging 4-8 days, the supernatants of the infected 
indicator cells were used to infect either human TE671 cells 
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or murine NIH3T3 cells. 

TELCeB6 cells transfected with various env- expressing 
constructs, pCRIPAMgag-, FBASAF and FBdelPASAF were 
compared. Although the supernatants of TELCeB 6 -FBdelPASAF 
cells were devoid of replication-competent retroviruses, 
they were found sporadically to transfer gag-pol genomes 
(Table 4) . No GPR viruses could be detected when less than 
2xl0 5 virions were used to infect the indicator cells. 
Similarly TELCeB 6 indicator cells infected with various 
helper- free viruses were shown sporadically to release lacZ 
virions (Table 4) . The number depended both on the env- 
expression vector used and on the virus input quantity. 
Compared to lacZ viruses generated using pCRIPAMgag- 
plasmid, the frequency of detection of the env- recombinant 
viruses was lower for supernatants generated by using FBASAF 
and FBdelPASAF constructs (Table 4) For FBdelPASAF 
construct when less than 5x10 s MFGnlslacZ (A) helper-free 
virions were used to infect the indicator cells, no ER 
retroviruses could be detected. From these experiments, it 
could be estimated that a supernatant, produced from 
TELCeBG -FBdelPASAF cells, containing lxlO 7 infectious units 
of MFGnlslacZ retroviral vector contained no replication- 
competent virus, and about 100 gag-pol and 100 env 
recombinant retroviruses . 
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Table 4. Transfer of packaging function 



Producer cell 



25 



30 



35 



TELCeB6-pCRIPAMgag- 
TELCeB6-pCRIPAMgag- 
TELCeB6-pCPJPAMgag- 

TELCeB6-FBASAF 
TELCeB6-FBASAF 
TELCeB6-FBASAF 

TELCeB6-FBdelPASAF 
TELCeB6-FBdelPASAF 
TFJLCeB6-FBdelPASAF 



Indicator cell Input vims' Detection^" 



(lacZ-i.u.) + + + 









ReDlication cnmnctpnt vimc 








psiCRIP lacZ 5 


TEL 


2X10 4 


0/4 


0/4 


.4/4 


10 


TELCeB6-pCRIPAMgag- 


TEL 


5xl0 6 


0/4 


0/4 


4/4 


TELCeB6-FBASAF 


TEL 


5xl0 5 


0/4 


0/4 


4/4 




TELCeB6-FBdelPASAF 


TEL 


5xl0 6 


0/4 


0/4 


4/4 


15 


FLYA4 lacZ 3 
FLYA4 lacZ 7 


TEL 
TEL 


lxlO 7 
lxlO 7 


0/4 
0/4 


0/4 
0/4 


4/4 
4/4 








Gac-DOl recombinant 








20 


TELCeB6-FBdelPASAF 7 
TELCeBo-FBdelPASAF 7 
TELCeB6-FBdelPASAF 7 
TELCeB6-FBdelPASAF 7 


TELMOSAF 2xl0 7 
TELMOSAF 2xl0 6 
TELMOSAF 2x10 s 
TELMOSAF 2x10* 


0/4 
0/4 
0/4 
0/4 


1/4 
2/4 
2/4 
0/4 


3/4 
2/4 
2/4 
4/4 



Env recomhinent 



TELCeB6 


5xl0 6 


2/4 


1/4 


1/4 


TELCeB6 


5x10 s 


1/4 


1/4 


2/4 


TELCeB6 


5xl0 4 


0/4 


2/4 


2/4 


TELCeB6 


5xl0 6 


0/4 


2/4 


2/4 


TELCeB6 


5x10 s 


0/4 


1/4 


3/4 


TELCeB6 


5X10 4 


0/4 


1/4 


3/4 


TELCeB6 


5xl0 6 


0/4 


1/4 


3/4 


TELCeB6 


5x10 s 


1/4 


3/4 


0/4 


TELCeB6 


5xl0 4 


0/4 


0/4 


4/4 



40 



a: number of lacZ i.u. used to infect indicator cells 

b: number of incidence out of four experiments. The ranges of lacZ titers 
rescued from infected indicator cells are shown for each virus input: >100 
lacZ i.u. /ml (++), l-lOO lacZ i.u. /ml (+) and <i lacZ i.u. /ml (-). 



Titers were determined on TE671 cells for replication 
competent virus and env recombinant and NIH3T3 cells for 
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gag-pol recombinant. 



Example 10 

In order to confirm resistance to complement and absence of 
replication competent virus in our best packaging lines, 
MFGnlslacZ (A) and (RD) harvested from FLYA13 and FLYRD18, 
respectively, after polyclonal transduction of MFGnlslacZ 
(Table 3 above) were tested for stability in fresh human 
serum and generation of replication competent virus. Titers 
of MFGnlslacZ (RD) from FLYRD18 after 1 hr incubation with 3 
independent samples of fresh human serum were 80 to 120 % 
of control incubations, while titers of MFGnlslacZ (A) from 
FLYA13 were 50 to 90 % of controls (data not shown) . No 
replication competent virus was detected in the same assay 
described above (Table 4) when 1 x 10 7 i.u. each of 
MFGnlslacZ (A) and (RD) were tested. 

EXAMPLE 11. 

Generation of plasmids. 

CeB plasmid (Fig. 5) expressing MoMLV gag-pol gene, was 
further modified to remove the splice donor site located in 
the leader region. A 272 bp fragment was PCR-generated by 
using OUSD- (5 ' -TCTCGCTTCTGTTCGCGCGC) and OLSD- 
(5 ' -TCGATCAAGCTTGCGGCCGCGGTGGTGGGTCGGTGGTCC) as primers and 
further digested with BssHII and Hindlll. A 1008 bp 
Hindlll-Xhol fragment isolated from CeB (encompassing a part 
of leader sequence and beginning MoMLV gag) and the PCR 
fragment were co-inserted into pCeB from which the 1275 bp 
BssHII -Xhol fragment (encompassing R-U5- leader -gag) had been 
removed. The resulting- plasmid, named pCeB DS- (Fig. 5), 
beared the deletion of splice donor (SD) site and a NotI 
restriction site created just downstream to the lost SD 
site. 
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A series of gag-pol expression plasmids in which the MoMLV 
LTR promoter was replaced by the human cytomegalovirus 
immediate early promoter (hCMV promoter) was derived from 
both CeB DS- and hCMV-G (Yee et al . , 1994 PNAS, 91: 9564- 
5 9568) , a plasmid used as a source for the hCMV promoter. A 

Notl-filled/EcoRI 7260 bp fragment was isolated from CeB DS- 
and cloned into hCMV-G which had been opened with Sail 
(further rendered blunt-ended) and EcoRI to remove the VSV-G 
gene. The resulting plasmid was cutted with Clal and EcoRI 

10 to remove a 1155 bp fragment encompassing sequence derived 
from 3' -LTR and SV40 polyA sequence and self-ligated after 
filling both protruding DNA ends. The resulting plasmid, 
named phCMV-intron (Fig. 5), had gag-pol and bsr ORFs 
inserted between the CMV promoter and rabbit beta-globin 

15 polyA post -transcriptional regulatory sequences. 

An intermediate plasmid was generated by sub-cloning a 7260 
bp EcoRI fragment (isolated from CeB DS- ) into hCMVG opened 
with EcoRI. A 1155 bp fragment (encompassing sequence 

20 derived from 3' -LTR and SV40 polyA sequence) was removed 
from this intermediate plasmid which was then 
re -circularized by self ligation after filling both ends. 
The resulting plasmid, named phCMV+intron 2P (Fig. 5), was 
digested with NotI and the vector was treated with klenow 

25 enzyme, A 1440 bp fragment (encompassing hCMV promoter and 

rabbit beta-1 globin intron B (Rohrbaugh et al., 1985 Mol. 
Cell Biol, 5: 147-160)) was isolated from phCMV+ intron 2P by 
Notl/EcoRI digestion. This fragment was further treated with 
klenow enzyme and ligated back into the vector. The 

30 resulting plasmid, named hCMV+ intron (Fig. 5.), could express 
gag-pol and bsr genes driven by the hCMV promoter and beared 
an intron sequence derived from rabbit beta-l globin intron 
B having both SD and SA (splice acceptable) sites. 



35 



A 2450 bp fragment was removed from phCMV+intron 2P by 



WO 97/08330 



PCT/GB96/02061 



34 



10 



15 



25 



30 



Notl/Xhol digestion. The resulting vector fragment was then 
used to co-ligate a 1330 bp fragment (containing hCMV 
promoter + 5' end of rabbit beta-1 globin intron B (with SD 
site)) isolated from phCMVG by Apal-f illed/Notl digestion 
and a 1 kb fragment isolated from phCMV+intron 2P by 
Notl-filled/XhoI digestion. Compared to phCMV+intron 2P, the 
resulting plasmid, named hCMV+SD intron (Fig. 5) , had the 
deletion of the 3' end of the rabbit beta-l globin intron B 
and thus no SA site in the leader region. 

Construct phCMV+leader (Fig. 5) has been described elsewhere 
(Savard et al., unpublished). This plasmid, in which gag-pol 
and bsr genes were driven by the hCMV promoter, had the 
MoMLV SD site in the leader region. 



Gag-pol expression. 

The different constructs, including the parental CeB 
plasmid, were analysed comparatively in a complementation 
assay after transfection in TEL - FBde 1 PASAF cells expressing 
20 4070A-MLV (amphotropic) envelope and harboring a MFGnlslacZ 

provirus. The transient production of lacZ retroviruses as 
well as the stable production of lacZ retroviral vectors 
after selection with blasticidin S were determined (Table 
5) . All the constructs were able to rescue infectious lacZ 
retroviruses indicating the expression of gag-pol proteins 
after transient transfection. Most likely due to the 
efficient hCMV and rabbit beta-1 globin intron B 
(post) -transcriptional regulatory sequences, hCMV+intron was 
particularly potent in transient retroviral vector 
production. However, 10 times less blasticidin-resistant 
colonies were obtained with hCMV+intron comparatively to 
CeB, and stable lacZ virus production from hCMV+intron was 
about 5-10 times lower than that of CeB. Clonal examination 
of lacZ retrovirus production from blasticidin-resistant 
35 colonies indicated that 80-90% of colonies could express 
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high levels of gag-pol proteins for both hCMV+intron and CeB 
plasmids. In contrast, despite variation in their ability to 
form blasticidin-resistant colonies after transfection and 
despite their ability to express gag-pol proteins from 
transient transf ectants , all other constructs had a weak 
capacity for rescuing lacZ retroviral vectors from stable 
transfectants (Table 5) . 



Table 5. Comparative study of gag-pol-bsr plasmids. 



gag-pol -bsr 
plasmid 


Transient 
(lacZ 
i .u. /ml) 


no clones 
bsr* 


Stable 
(lacZ 
i . u . /ml 


% gag-pol 
/bsr 


Ceb 


300/ml 


50 


10 7 


90% 


Ceb DS- 


144/ml 


5 


10 s 


50% 


hCMV+intron 
2P 


ND 


20 


10 6 


50% ; 


hCMV- intron 


8 12 /ml 


0 






hCMV+SD 
intron 


15 0 /ml 


1000 


10 2 


nd 


hCMV+leader 


328/ml 


1000 


10 2 -10 3 


nd 


hCMV+intron 


12000/ml 


5 


10 6 -10 7 


80% 



Northern blot analyses were performed on stable 
transfectants (blasticidin-resistant) obtained with some of 
the gag-pol-bsr plasmids. As expected, the results (not 
shown) displayed a correlation between expression of gag-pol 
mRNAs and gag-pol protein expression detected by rescue 
analysis (Table 5) . CeB construct was found to produce 2-3 
fold more gag-pol mRNAs compared to hCMV+ intron. 
Interestingly, an unexpected 2.45 kb RNA band was found for 
hCMV+ intron construct at a ratio of 2:1 compared to the 
abundancy of the gag-pol mRNA band (at 5.95 kb) . Further 
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investigations by using other probes revealed that a cryptic 
splice donnor (SD) site located in the gag gene (right in 
the middle of the CA coding region at position 1596-1597 
-numbering according to Shinnick et al., 1981 Nature 
(London) 293: 543-548) was activated in this latter 
construct. The 2.45 RNA species, lacking the 3' half of the 
gag gene and most of the pol gene, is unlikely to give rise 
to any useful translational product. It is therefore 
interesting to notice that hCMV+intron construct was able to 
give rise to slightly more transcripts (gag-pol 5.95 mRNA + 
2.45 alternative RNA band) compared to gag-pol mRNA 
expressed from CeB construct. Therefore we decided to 
inactivate the cryptic SD site in the hCMV+intron construct 
in order to increase the ratio of gag-pol mRNAs. 

Assays for transfer of gag-pol functions. 

Although the supernatants of pacakaging cell lines generated 
with CeB gag-pol expression contruct were devoid of 
replication-competent retroviruses, they were found 
sporadically to transfer gag-pol genomes (example 9, Table 

4) (Cosset et al., 1995 J. Virol 69: 7430-7436). Because 
gag-pol -bsr constructs generated here by using the hCMV 
promoter had much less retroviral sequences homologous to 
the retroviral vector than the parental CeB construct (Fig. 

5) , they are less likely to give rise to gag-pol recombinant 
(GPR) viruses. Therefore, the most efficient gag-pol-bsr 
plasmids, hCMV+intron and CeB, were further analysed for 
emergence of GPR viruses. To assay for such recombinant 
retroviruses, we attempted to mobilise an lacZ provirus from 
an indicator cell lines which could cross -complement 
potential recombinant viruses carrying gag-pol functional 
genes. Results displayed in Table 6 showed that consistently 
with data reported previously (example 9, Table 4) (Cosset et 
al., 1995 Supra), lacZ retrovirus vectors generated by using 
CeB gag-pol construct were contaminated with GPR viruses. In 
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contrast lacZ retrovirus vectors generated by using 
hCMV+intron construct were completely devoid of such GPR 
viruses, suggesting that this construct was 'improved 
compared to CeB with respects with emergence of recombinant 
5 viruses. 



Table 6. Comparative study of gag-pol-bsr plasmids. 



10 



plasmid 


input virus 
(lacZ i.u.) a 


no of experiments 
giving titres of b 


CeB 


5x10 s 


5 


3 


0 




5x10 s 


2 


4 


2 




5x1 0 4 


0 


1 


7 


hCMV+intron 


5xl0 6 


0 


0 


8 




5x1 0 s 


0 


0 


8 




5xl0 4 


0 


0 


8 



4X10E4 cells of TEL/MOSAF in 24 wells were challenged with lacZ (A) of i.u. 
indicated in the table (a), and incubated at 37°C for 3 days. Cells were 
trypsinized and transferred into small flasks. Cell sup was harvested on day 5 
after lacZ (A) challenge and plated on either TE571 (not shown) and 3T3 cells 
(b) . No lacZ was mobilized into TE671 at all. LacZ (A) from CMV-int 10 again 
did not rescue lacZ from TEL/MOSAF. 

Example 12 

Generic primers to detect D-type (Medstrand and Blomberg 
J. Virol. (1993) 67:6778-6787) , C-type (Shih et al . , J 
Virol. (1989) 63:64-75), human endogenous virus RTVL-H 
(Wilkinson et al., J.Virol. (1993) 67:2981-2989), by RT-PCR 
were employed (Patience et al., supra). Primers to detect 
mouse endogenous VL30 element (Adams et al Mol .Cel .Biol . 
(1988) 8:2989-2998), and MFGnlslacZ RNA were designed and 
synthesized (TABLE X) . Overnight supernatants (in 4ml of 
culture medium) from 106 cells of GP+EAM12lacZ25 , FLYA4lac23 
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and TELCeB 6 FBASALF cells (Table 3) were harvested and" 
centrifuged in sucrose gradient as described previously 
(Patience et al. , J.Virol., 70:2654-2657). Fractions 
containing retrovirus particles were collected, and RNA 
extracted. One twentieth of the RNA preparation or 
dilution's thereof were applied to RT-PCR as described 
previously (Table X) . a 1/200 of RNA harvested from 
GP+EAM121acZ25 cells was positive for VL30 RNA. MFGnlslacZ 
RNA was found from 1/20 of RNA from GP+EAMl21acZ and 
TELCeB 6 FBASALF cells and 1/200 of RNA from FI,YA41acZ3 cells. 
The primer combinations for RTVL-H, C- and D-type RNA did 
not give detectable PCR product. 

Table 7. RT-PCR detection of endogenous retrovirus RNA 
associated with virus particles. 



rt-pcr of virion associated RNA from* 



RNA 



primer (5' -3') 
forward (F) /reverse (R) 



GP+EAM12 
lacZ25 



FLYA4 
lacZ3 



TELCeB 6 F 
BASALF 



MFGnls F) CTCTGGCTCACAGTACGACGTAG + 

lacZ R) CCATCAATCCGGTAGGTTTTCCG 



C- type F) CARRGKTTCAARAACWS YCCCAC 

R) AGYARVGTAGCNGGGTTHAGG 

D - type F) TCCCCTTGGAATACTCCTGTTTTYGT 

R) CATTCCTTGTGGTAAAACTTTCCAYTG 



RTVL-H F) CCTCACCCTGATCACRYTTG 

R) GAATTATGTCTGACAGAAGGG 



NT 



VL30 



F) GTTGACATCTGCAGAGAAAGACC ++ 
R) TCTGAGGTCTGTACACACAATGG 



NT 



NT 
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a:-, not detected; + detected in 1/20 RNA preparation; ++ detected in 1/200 RNA 
preparation; NT, not tested because the cells do not possess the corresponding 



genes 
EXAMPLE 13 



Generation of gag-pol pre-packaging cells by using TE671 
cells . 

CeB, a plasmid designed to over-express MoMLV gag and pol 
proteins was introduced in TE671 human rhabdomyosarcoma 
cells (ATCC CRL8805) . After selection with blasticidin, 50 
bsr-positive colonies were isolated and the RT (reverse 
transcriptase) activity was analysed in their supernatants . 
12 TE671-CeB (TECeB) clones with high RT activity were 
selected for further analysis. The best TECeB clone, clone 
#15, had a RT activity roughly equivalent to that TELCeB6 
cells (Cosset et al., J. Virol. 69:7430-7436 (1995); see 
also Example 7, Table 6 in this patent application) but 
displayed 2-3 fold more gag-precursors into cells as 
demonstrated in immunoblots by using anti-CA antibodies. The 
biological activity of gag-pol proteins expressed in the six 
best TECeB clones was further confirmed by their ability to 
produce infectious retroviruses in a complementation assay. 
A lacZ provirus was introduced into each of the TECeB clones 
by polyclonal cross-infection by using lacZ(RD114) helper- 
free retrovirus vectors. FBMOSALF, a MoMLV env expression 
plasmid (Cosset et al., J. Virol. 69:6314-6322), was then 
transfected in each of the TECeB-lacZ lines and in the 
TELCeB6 cell line for comparison. After selection with 
phleomycin, the titer of lacZ retrovirus vectors was 
determined in the supernantant of pools of phleomycin- 
resistant colonies for each TECEB - 1 a c Z - FBMOSALF lines. A 
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good correlation was found between gag-pol expression" into 
the TE-CeB clones (as determined by RT-assays and anti-gag 
immunoblots) and their ability to release infectious lacZ 
particles. TE-CeB15 cells could release approximately the 
same number of lacZ particles when compared to TELCeB6 cells 
although TELCeB6 cells had the advantage of being selected 
for lacZ expression (Cosset et al., J. Virol. 69:7430-7436 
(1995)). TE-CeB15 cells were therefore used to derive 
retroviral packaging cell lines. 

Construction of env-eacpression plasmids. 

A series of plasmid (Fig. 3) was designed to allow 
expression of different retroviral envelope genes (isolated 
from MoMLV, GALV -Gibbon Ape Leukemia Virus-, and MLV-10A1) . 
FBdelPMOSAF (Fig. 3, nucleotide sequence in -Fig. 10) and 
FBdelPlOAlSAF, expressing ecotropic MoMLV or MLV-10A1 
envelopes, were generated by replacing the Bglll/Clal 
fragment from FBdelPASAF (Cosset et al., J. Virol. 69:7430- 
7436 (1995); see also Example 7, Fig. 2 and nucleotide 
sequence in Fig. 9) encompassing most of the env gene and 
splice acceptor site with that of MoMLV (position 5407 to 
7679, Shinnik et al., 1981) or with that of MLV-10A1 (Ott et 
al., J. Virol. 64:757-766 (1990)). 

Nucleotides 7514-7516 of GALV (Delassus et al., Virology 
173:205-213 (1989)) were mutated by PCR-mediated mutagenesis 
to create a Clal site (AAG to CGA) , thereby introducing a 
conservative modification (a lysine (amino-acid 665 of GALV 
env precursor) to an arginine) . The BamHI/Clal fragment (nts 
4994 (Delassus et al. Virology 173:205-213 (-1989)) to 7517) 
was then sub-cloned into FBdelPASAF in which the Bglll/Clal 
encompassing most of the env gene and splice acceptor site 
had been removed. The resulting plasmid, expressing GALV 
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envelope glycoproteins, was named FBdelPGASAF (Fig. 3, 
nucleotide sequence in Fig. 11) . 

CMV10A1 was generated by inserting a Klenow enzyme-filled 
Eagl/Sall fragment from FBdelPlOAlSAF (encompassing 10A1 MLV 
env gene and phleo selectable marker) into hCMV-G digested 
.with BamHI and filled with Klenow enzyme. The resulting 
plasmid, CMV10A1 (Fig. 3 and nucleotide sequence in Fig. 13) 
could express 10A1 envelopes under control of the hCMV 
promoter and the phleo selectable marker by translation re- 
initiation. 

Generation of a multi-tropic set of TE671-based retroviral 
packaging lines. 

FBdelPRDSAF (Fig. 3, nucleotide sequence in Fig. 12), 
FBdelPASAF, FBdelPGASAF, FBdelPMOSAF and FBdelPlOAlSAF were 
independently introduced into cells of the TE-CeB15 pre- 
packaging line, expressing MoMLV gag-pol proteins. 
Transfected cells were phleomycin-selected and 15-20 phleo- ■ 
resistant colonies were isolated for each env-expression 
plasmid transfected. 

Individual colonies were then analysed for expression of 
envelope glycoproteins by immunoblots on cell lysates by 
using antibodies against RD114 SU glycoproteins or against 
Rausher leukemia virus SU (to screen MoMLV, MLV-4070A and 
MLV-10A1 env-producer clones) or against GALV. The best env- 
producer colonies as determined in this assay were further 
analysed by a complementation assay after introducing a lacZ 
retroviral vector. LacZ pseudotypes released from the 
different packaging cell lines were titrated by using NIH 
3T3 cells or TE671 cells as target. Titers higher than lxlO 7 
lacZ i.u./ml were obtained for the best clones. Depending on 
the envelope specificities expressed in these cells, the new 
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TE671-based retroviral packaging cell lines were named TE- 
FLYE, TE-FLYA, TE-FLYRD, TE-FLY10A1, and TE-FLYGA and could 
express the MoMLV, MLV-4070A, RD114, MLV-10A1, and GALV env 
genes, respectively. 

Assays for detecting replication-competent retroviruses 
(RCRs) were performed in the supernatants of these cells and 
were negative (less than 1/ml) . 

TE671 cells are very potent for transient expression 
resulting in more than 95% of cells expressing transgene 
three days after plasmid transfection (Hatziioannou and 
Cosset, unpublished data,. (1996)). The ability of retroviral 
packaging cell lines to transiently produce retroviral 
vectors is of crucial importance for gene therapy where 
vectors carrying toxic gene have to be prepared. Transient 
expression of retroviral vectors was comparatively 
determined from cells of the TE-FLYA line and from the BING 
line (Pear et al., Proc Natl Acad Sci U S A 90, 8392-6 
(1993) ) , a retroviral packaging cell line designed to 
transiently express retroviral vectors. Results (Table 8) 
showed that "TE-FLYA cells were more efficient for transient, 
expression of a lacZ retroviral vector hence resulting in 
higher titers. 



Table 8. Comparative study of transient production of lacZ 
vectors . 



packaging 
cell line 


cell number* 


% transfected 
cells" 


transient 
titer 6 


BING 


281 


5.3 


2xl0 2 


TE-FLYA 

Cells were fMn«fo/>fft 


117 


35 


1.3xl0 3 



precipitation method and titers of of lacZ vectors (c) released in cell 
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supernatant were determined as lacZ i.u./ml at day 3 following transection. The 
relative number of cells (a) (average per microscope field) and the % of 
transfected cells (b) determined after X-gal staining are shown. 

Retroviral vectors prepared from TE671-based packaging cell 
lines were analysed for their sensitivity to human- 
complement mediated inactivation. Experiments were conducted 
as previously described (Cosset et al., J. Virol. 69:7430- 
7436 (1995); see also Example 10 in this patent application) 
by using three human sera of individual donnors (Table 9) . 
As expected MLV-A prepared from mouse 3T3 cells were highly 
sensitive to inactivation after 1 hr incubation witn sera. 
In contrast, titers of lacZ vectors produced from TE-FLYRD 
cells were 17 to 55% of control incubations, while titers of 
lacZ vectors from TE-FLYA cells were 1 to 30% of controls. 



Table 9. Human serum sensitivity of viruses produced from 
TE671-based packaging cell lines. 



Virus from: 


hu56 a 


hu57 a 


BTS a 


3T3/A 


<0.2, <0.2 


<0.2, <0.2 


<0.2, <0.2 


TE-FLYE 


15, 7.8 


16, 11 


48, 60 


TE-FLYA 


1, 0.6 


2.2, 7.1 


28, 19 


TE-FLYRD 

Three human fre.sh nor- 


17, 22 


30, 44 


54, 63 



BTS (AB+) . (a) % control (average for FCS and opt i -MEM treatment) is shown. 
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CLAIMS : 

1. A recombinant expression vector comprising a gene of 
interest and a selectable marker gene, wherein the 
selectable marker gene is arranged downstream of the 
gene of interest and a st6p codon associated with the 
gene of interest is spaced from a start codon of said 
selectable marker gene at a distance which is 
sufficient to ensure that said selectable marker 
protein is expressed from the corresponding mRNA as a 
result of translation reinitiation. 

2. A recombinant expression vector according to claim 1 
wherein the vector is a viral vector. 

3. A recombinant expression vector according to claim 2 
wherein the vector is a retroviral vector. 

4. A recombinant expression vector according to any one 
of claims 1 to 3 wherein the gene of interest is 
included as part of a viral packaging construct. 

5. A recombinant expression vector according to any one 
of the preceding claims wherein the number of 
nucleotides in the space between the stop codon of the 
gene of interest and the start codon of the selectable 
marker is in the range of from 20 to 200 nucleotides. 

6. A recombinant expression vector according to claim 5 
wherein the number of nucleotides in the space between 
the stop codon of the gene of interest and the start 
codon of the selectable marker is in the range of from 
60 to 80 nucleotides. 

7 . A process for producing a cell line in which a gene of 
interest is expressed, which process comprises: 

transforming host cells with an expression vector 
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according to any one of the claims 1 to 6; and 

selectable those cells where expression of the 
selection marker gene may be detected. 

8. A process according to claim 7 wherein the host cell 
is a eukaryotic cell. 

9. A host cell transformed with a recombinant expression 
vector according to any one of the claims 1 to 6 . 

10. A retroviral packaging cell line comprising a host 
cell transformed with a first and a second recombinant 
expression vector, said first recombinant expression 
vector having a packaging- deficient construct 
comprising a viral gag-pol gene and a first selectable 
marker gene downstream thereof, and said second 
recombinant, expression vector having a packaging- 
deficient construct comprising a viral env gene and a 
second selectable marker gene downstream thereof; 
wherein the start codon of the first and second 
selectable markers are spaced from the stop codons of 
the viral gag-pol gene and the viral env gene 
respectively by a distance which ensures that said 
selectable marker protein is expressed from the 
corresponding mRNA as a result of translation 
reinitiation. 

11. A retroviral packaging cell line according to claim 10 
wherein the first selectable marker is a bsr 
selectable marker and the second selectable marker is 
a phleo selectable marker. 

12. A retroviral packaging cell line according to any one 
of claims 10 or 11 wherein the packaging-deficient 
construct comprising the viral gag-pol gene and first 
selectable marker is the CeB (SEQ ID No 2) expression 
construct . 
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13 . A retroviral packaging cell line according to any one 
of claims 10 or 11 wherein the packaging-deficient 
construct comprising the viral env gene and second 
selectable marker is the FBdelPASAF (SEQ ID No 5), 
the FBdelPMOSAF (SEQ ID No 6) , the FbdelPGASAF (SEQ ID 
No 7), the FbdelPRDSAF (SEQ ID No 8), the FbdelPXSAF 
(Fig. 3), the FbdelPlOAlSAF (Fig. 3), or the 
FBde 1 PVS VGSAF (Fig. 3) expression construct. 

14. A retroviral packaging cell line according to any one 
of claims 10 or 11 wherein the recombinant expression 
vector is a packaging-deficient retroviral helper 
construct. 

15. A retroviral packaging cell line according to claim 14 
wherein the overlapping sequences between the genomes 
of the retroviral vector and the packaging-deficient 
construct is reduced by minimizing the extent of non- 
coding retroviral sequences in the packaging-deficient 
genome . 

16. A retroviral packaging cell line according to any one 
of claims 10 to 15 wherein the viral gag-pol gene and 
the selectable marker are expressed under the control 
of a non- retroviral promoter. 

17. A retroviral packaging cell line according to claim 16 
wherein the promoter is fused to rabbit beta-1 globin 
intron . 

18. A retroviral packaging cell line according to claim 16 
or claim 17 wherein the promoter is a hCMV promoter. 

19. A retroviral packaging cell line according to any one 
of claims 16 to claim 18 wherein the viral gag-pol 
gene and the selectable marker is a hCMV+intron (SEQ 
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ID No3) or a hCMV+intronkaSD (SEQ ID No 4) expression 
construct . 

20. A retroviral packaging cell line according to anyone 
of claims 10 to 15 wherein the viral env gene and the 
selectable marker are under the control of a non- 
retroviral promoter. 

21. A retroviral packaging cell line according to claim 20 
wherein the promoter is fused to rabbit beta-1 globin 
intron. 



22. A retroviral packaging cell line according to claim 20 
or claim 21 wherein the promoter is a hCMV promoter. 

23 . A retroviral packaging cell line according any one of 
claims 20 to 22 wherein the viral env gene and the 
selectable marker is a CMV10A1 (SEQ ID No 3) 
expression construct. 



24. A retroviral packaging cell line according to any one 
of claims 10 to 23 wherein the cell line is the HT1080 
line, the TE671 line/ the 3T3 line, the 293 line or 
the MV-1-1U line. 

25. A retroviral packaging cell line according to anyone 
of claims 10 to 24 wherein the retroviral packaging 
cells comprises human HT1080 cells and express RD114 
envelopes . 

26. A retroviral packaging cell line according to anyone 
of claims 10 to 24 wherein the retroviral packaging 
cells comprises human TE671 cells and express RD114 
envelopes . 
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27. A process for producing a retroviral packaging cell 
line in which a gene of interest in expressed, which 
process comprises: 

transforming host cells with a first and a second 
recombinant expression vector, said first recombinant 
expression vector having a packaging-deficient 
construct comprising a viral gag-pol gene and a first 
selectable marker gene downstream thereof, and said 
second recombinant expression vector having a 
packaging-deficient construct comprising a viral env 
gene and a second selectable marker gene downstream 
thereof; wherein the start codon of the first and 
second selectable markers are spaced from the stop 
codons of the viral gag-pol gene and the viral env 
gene respectively by a distance which ensures that 
said selectable marker protein is expressed from the 
corresponding mRNA as a result of translation 
reinitiation; and 

selecting transformed cells which express said 
first and/or second marker genes. 

28. A packaging deficient construct for use in a process 
according to claim 27, which expresses a viral gag-pol 
gene and a selectable marker wherein a start codon of 
the selectable marker is spaced from a stop codon of 
the viral gag-pol gene by a distance which ensures 
that said selectable marker protein is expressed from 
the corresponding mRNA as a result of translation 
reinitiation. 

29. A packaging deficient construct for use in a process 
according to claim 27, which expresses a viral env 
gene and a selectable marker gene; wherein a start 
codon of the selectable marker is spaced from a stop 
codon of the viral env gene by a distance which 
ensures that said selectable marker protein is 
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expressed from the corresponding mRNA as a result of 
translation reinitiation. 
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v de! psi 



MLV-LTR 




JSD 



Mo MLV GAG-POL 



V T 



j 

SA 



BSR 



j- 



poly A 



pol ge/7g.. TCTAGA CTG ACA T3 G CG'C 
GTT CAA CGC TCT CAA AAC CCC IT A 
AAA ATA AGG TTA ACC CGC GAG GCC 
CCC TAA 

tccccttaattcttctcatgctcagaggggtcagtac 
tGCttcQcccQCCtccaotQc agcccaoccQQ ccacc 
AT® AAA ACA TTT AAC ATT TCT... bsr gene 



Figure 1. Schematic structure of CeB expression vector 
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FB29 LTR 



H 

JSD 



SA 



ENV 



1r 



7 * 


f 


- PHLEO 


poly A 





ew gene.. J AC GAG CCA TAG 
QQcqcc tagtgttgacaattaatcatcggcatagtata 
cggcatagtataatacgactcactataggagggccacc 
ATBG CC AAG TTG ACC.phleo gene 



Figure 2. Schematic structure of FBdelPASF expression vector 
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Kasl 

U r 



sc3cRDl 14 env onieo 



poiyA 



Mscl/Ndel 
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;<asl 



I 
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Msci/BamHl 



FBdelPVSVGSAF Kasl 



F329 LTR 



_2 yU ▼ 

H vsv-G KHoF»eo> 



polyA 



Msd/Bgill 



CMV10A1SAF 


Oa 

V 
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J. . 


polyA 


i 


rr-n — " uv ~ ,v — 

so sa| 

SamHl/Eagl 
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Figure 3. Schematic structure of env expression vectors 
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NGAGCTC AGGACAGGTAGAAAGAATGAATAGAACAATAAAAGAGACCCTTACTAAATTGA 6 0 
CCTTAGAGACTGGCTTAAAAGATTGGAGACGCCTCCT ATCTCTGGCTTTGTTAAGAGCCA 120 
G AAATACGCCCAACCGTTTTCGGCTC ACCCCATATGAAATCCTTTATGGGGGACCCCCCC 180 
CTTTGTC AACCTTGCTCAATTCCTTCTCCCCCTCCGATCCTAAGACTGATTTACAAGCCC 240 
GACTAAAAGGGCTGCAAGGCGTGCAGGCCCAAATCTGGACACCCCTGGCCGAATTGTACC 3 00 
GGCCAGGACATCCACAAACTAGCCACCCATTTCAGGTGGGAGACTCCGTGTACGTCCGGC 3 60 
GGC ACCGCTCTCAAGGATTGGAGCCTCGTTGGAAGGGACCTTAC ATCGTCCTGCTGACCA 420 
CGCCCACCGCC ATAAAGGTTGACGGGATCGCCGCCTGGATTCACGCATCGCACGGC AAGG 480 
CAGCCCCAAAAACCCCTGGACCAGAAACTCCCAAAACCTGGAAGCTCCGCCGTTCGGAGA 540 
ACCCTCTTAAGATAAGACTCTCCCGTGTCTGACTGCTAATCCACCTTGTCCCTGTACTAA 600 
CCC AAAATGAAACTCCC AAC AGGAATGGTCATTTTATGT AGCCT AATAAT AGTTCGGGC A 660 
GGGTTTGACGACCCCCGCAAGGCTATCGCATTAGTACAAAAACAACATGGTAAACC ATGC 720 
GAATGCAGCGGAGGGCAGGTATCCG AGGCCCCACCGAACTCCATCCAACAGGTAACTTGC 7 30 
CC AGGCAAGACGGCCTACTT AATGACCAACC AAAAATGGAAATGCAGAGTCACTCC AAAA 840 
ATCTC ACCTAGCGGGGGAGAACTCC AGAACTGCCCCTGT AACACTTTCCAGGACTCGATG 900 
CACAGTTCTTGTTATACTGAATACCGGCAATGCAGGCGAATTAATAAGACATACTACACG 960 
GCCACCTTGCTTAAAATACGGTCTGGGAGCCTCAACGAGGTACAGATATT ACAAAACCCC 1020 
AATCAGCTCCTAC AGTCCCCTTGT AGGGGCTCTATAAATC AGCCCGTTTGCTGGAGTGCC 1080 
ACAGCCCCCATCCATATCTCCGATGGTGGAGGACCCCTCGATACTAAGAGAGTGTGGACA 1140 
GTCC AAAAAAGGCTAGAACAAATTC ATAAGGCTATGACTCCTGAACTTC AATACC ACCCC 1200 
TTAGCCCTGCCC AAAGTC AGAGATGACCTTAGCCTTGATGC ACGGACTTTTGATATCCTG 1260 
AATACC ACTTTTAGGTTACTCC AGATGTCCAATTTTAGCCTTGCCCAAGATTGTTGGCTC 13 20 
TGTTTAAAACTAGGTACCCCTACCCCTCTTGCGATACCCACTCCCTCTTTAACCTACTCC 1380 
CTAGCAGACTCCCTAGCGAATGCCTCCTGTCAGATTATACCTCCCCTCTTGGTTCAACCG 1440 
ATGCAGTTCTCCAACTCGTCCTGTTTATCTTCCCCTTTCATTAACGATACGGAACAAATA 1500 
GACTTAGGTGCAGTCACCTTTACTAACTGCACCTCTGTAGCCAATGTCAGTAGTCCTTTA 1560 
TGTGCCCT AAACGGGTC AGTCTTCCTCTGTGGAAAT AACATGGC ATACACCTATTT ACCC 1620 
CAAAACTGGACCAGACTTTGCGTCC AAGCCTCCCTCCTCCCCGACATTGACATCAACCCG 1680 
GGGGATG AGCC AGTCCCCATTCCTGCC ATTGATCATTATATACATAGACCTAAACG AGCT 1740" 
GTAC AGTTCATCCCTTTACTAGCTGGACTGGGAATCACCGCAGCATTCACCACCGGAGCT 1800 
AC AGGCCT AGGTGTCTCCGTC ACCC AGT ATACAAAATT ATCCCATCAGTTAATATCTGAT 1860 
GTCCAAGTCTTATCCGGTACC ATACAAGATTTACAAGACCAGGT AGACTCGTTAGCTGAA 1920 
GTAGTTCTCCAAAATAGGAGGGGACTGGACCTACTAACGGCAGAACAAGGAGGAATTTGT 1980 
TTAGCCTTAC AAG AAAAATGCTGTTTTTATGCTAACAAGTCAGGAATTGTGAGAAACAAA 2040 
ATAAGAACCCTACAAGAAGAATTACAAAAACGCAGGGAAAGCCTGGCAACCAACCCTCTC 2 100 
TGGACCGGGCTGCAGGGCTTTCTTCCGTACCTCCTACCTCTCCTGGGACCCCTACTCACC 2160 
CTCCTACTC ATACTAACCATTGGGCCATGCGTTTTCAGTCGCCTCATGGCCTTCATTAAT 2220 
GATAG ACTTAATGTTGTAC ATGCC ATGGTGCTGGCCCAGCAATACCAAGC ACTCAAAGCT 2280 
GAGGAAGAAGCTC AGGATTGAGCTTCCGGGACAAAAGC AGGGGGGAATGAGAAGTC AGAA 2340 
CCCCCC ACCTTTGCT ACAT AAATAACCGCTTTC ATTTCGCTTCTGTAAAACGCTTATGCG 2 400 
CCCCACCCTAGCCGGAAAGTCCCCAGCCGCTACGCAACCCGGGCCCCGAGTTGCATCAGC 2460 
CGTTCGC AACCCGGGCTCCGAGTTGCATC AGCCGAAAG AAACTTC ATTTCCCAAGCTT 2518 



Fig. 4 
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AATGAAAGAC CCCACCTGTA GGTTTGGCAA GCTAGCTTAA GTAACGCCAT TTTGCAAGGC - 60 

ATGGAAAAAT ACATAACTGA GAATAGAGAA GTTCAGATCA AGGTCAGGAA CAGATGGAAC 120 

AGCTGAATAT GGGCCAAACA GGATATCTGT GGTAAGCAGT TCCTGCCCCG GCTCAGGGCC 180 

AAGAACAGAT GGAACAGCTG AATATGGGCC AAACAGGATA TCTGTGGTAA GCAGTTCCTG 240 

CCCCGGCTCA GGGCCAAGAA CAGATGGTCC CCAGATGCGG TCCAGCCCTC AGCAGTTTCT 300 

AGAGAACCAT CAGATGTTTC CAGGGTGCCC CAAGGACCTG AAATGACCCT GTGCCTTATT 360 

TGAACTAACC AATCAGTTCG CTTCTCGCTT CTGTTCGCGC GCTTCTGCTC CCCGAGCTCA 420 

ATAAAAGAGC CCACAACCCC TCACTCGGGG CGCCAGTCCT CCGATTGACT GAGTCGCCCG 480 

GGTACCCGTG TATCCAATAA ACCCTCTTGC AGTTGCATCC GACTTGTGGT CTCGCTGTTC 540 

CTTGGGAGGG TCTCCTCTGA GTGATTGACT ACCCGTCAGC GGGGGTCTTT CATTTGGGGG 600 

CTCGTCCGGG ATCGGGAGAC CCCTGCCCAG GGACCACCGA CCCACCACCG GGAGGTAAGC 660 

TGGAAGCTTC TGCAGCATCG TTCTGTGTTG TCTCTGTCTG ACTGTGTTTC TGTATTTGTC 720 

TGAGAATATG GGCCAGACTG TTACCACTCC CTTAAGTTTG ACCTTAGGTC ACTGGAAAGA 780 

TGTCGAGCGG ATCGCTCACA ACCAGTCGGT AGATGTCAAG AAGAGACGTT GGGTTACCTT 840 

CTGCTCTGCA GAATGGCCAA CCTTTAACGT CGGATGGCCG CGAGACGGCA CCTTTAACCG 900 

AGACCTCATC ACCCAGGTTA AGATCAAGGT CTTTTCACCT GGCCCGCATG GACACCCAGA 960 

CCAGGTCCCC TACATCGTGA CCTGGGAAGC CTTGGCTTTT GACCCCCCTC . CCTGGGTCAA 1020 

GCCCTTTGTA CACCCTAAGC CTCCGCCTCC TCTTCCTCCA TCCGCCCCGT CTCTCCCCCT 1080 

TGAACCTCCT CGTTCGACCC CGCCTCGATC CTCCCTTTAT CCAGCCCTCA CTCCTTCTCT 1140 

AGGCGCCAAA CCTAAACCTC AAGTTCTTTC TGACAGTGGG GGGCCGCTCA TCGACCTACT 1200 

TACAGAAGAC CCCCCGCCTT ATAGGGACCC AAGACCACCC CCTTCCGACA GGGACGGAAA 1260 

TGGTGGAGAA GCGACCCCTG CGGGAGAGGC ACCGGACCCC TCCCCAATGG CATCTCGCCT 1320 

ACGTGGGAGA CGGGAGCCCC CTGTGGCCGA CTCCACTACC TCGCAGGCAT TCCCCCTCCG 1380 

CGCAGGAGGA AACGGACAGC TTCAATACTG GCCGTTCTCC TCTTCTGACC TTTACAACTG 1440 

GAAAAATAAT AACCCTTCTT TTTCTGAAGA TCCAGGTAAA CTGACAGCTC TGATCGAGTC 1500 

TGTTCTCATC ACCCATCAGC CCACCTGGGA CGACTGTCAG CAGCTGTTGG GGACTCTGCT 1560 

GACCGGAGAA GAAAAACAAC GGGTGCTCTT AGAGGCTAGA AAGGCGGTGC GGGGCGATGA 1620 

TGGGCGCCCC ACTCAACTGC CCAATGAAGT CGATGCCGCT TTTCCCCTCG AGCGCCCAGA 1680 

CTGGGATTAC ACCACCCAGG CAGGTAGGAA CCACCTAGTC CACTATCGCC AGTTGCTCCT 1740 

AGCGGGTCTC CAAAACGCGG GCAGAAGCCC CACCAATTTG GCCAAGGTAA AAGGAATAAC 1800 

ACAAGGGCCC AATGAGTCTC CCTCGGCCTT CCTAGAGAGA CTTAAGGAAG CCTATCGCAG 18.60 

GTACACTCCT TATGACCCTG AGGACCCAGG GCAAGAAACT AATGTGTCTA TGTCTTTCAT 1920 

TTGGCAGTCT GCCCCAGACA TTGGGAGAAA GTTAGAGAGG TTAGAAGATT TAAAAAACAA 1980 

GACGCTTGGA GATTTGGTTA GAGAGGCAGA AAAGATCTTT AATAAACGAG AAACCCCGGA 2040 

AGAAAGAGAG GAACGTATCA GGAGAGAAAC AGAGGAAAAA GAAGAACGCC GTAGGACAGA 2100 

GGATGAGCAG AAAGAGAAAG AAAGAGATCG TAGGAGACAT AGAGAGATGA GCAAGCTATT 2160 

GGCCACTGTC GTTAGTGGAC AGAAACAGGA TAGACAGGGA GGAGAACGAA GGAGGTCCCA 2220 

ACTCGATCGC GACCAGTGTG CCTACTGCAA AGAAAAGGGG CACTQGGCTA AAGATTGTCC 2280 

CAAGAAACCA CGAGGACCTC GGGGACCAAG ACCCCAGACC TCCCTCCTGA CCCTAGATGA 2340 

CTAGGGAGGT CAGGGTCAGG AGCCCCCCCC TGAACCCAGG ATAACCCTCA AAGTCGGGGG 2400 

GCAACCCGTC ACCTTCCTGG TAGATACTGG GGCCCAACAC TCCGTGCTGA CCCAAAATCC 2460 

TGGACCCCTA AGTGATAAGT CTGCCTGGGT CCAAGGGGCT ACTGGAGGAA AGCGGTATCG 2520 

CTGGACCACG GATCGCAAAG TACATCTAGC TACCGGTAAG GTCACCCACT CTTTCCTCCA 2580 

TGTACCAGAC TGTCCCTATC CTCTGTTAGG AAGAGATTTG CTGACTAAAC TAAAAGCCCA 2640 

AATCCACTTT GAGGGATCAG GAGCTCAGGT TATGGGACCA ATGGGGCAGC CCCTGCAAGT 2700 

GTTGACCCTA AATATAGAAG ATGAGCATCG GCTACATGAG ACCTCAAAAG AGCCAGATGT 2760 

TTCTCTAGGG TCCACATGGC TGTCTGATTT TCCTCAGGCC TGGGCGGAAA CCGGGGGCAT 2820 

GGGACTGGCA GTTCGCCAAG CTCCTCTGAT CATACCTCTG AAAGCAACCT CTACCCCCGT 2880 

GTCCATAAAA CAATACCCCA TGTCACAAGA AGCCAGACTG GGGATCAAGC " CCCACATACA 2940 

GAGACTGTTG GACCAGGGAA TACTGGTACC CTGCCAGTCC CCCTGGAACA CGCCCCTGCT 3000 

ACCCGTTAAG AAACCAGGGA CTAATGATTA TAGGCCTGTC CAGGATCTGA GAGAAGTCAA 3060 

CAAGCGGGTG GAAGACATCC ACCCCACCGT GCCCAACCCT TACAACCTCT TGAGCGGGCT 3120 
CCCACCGTCC CACCAGTGGT ACACTGTGCT TGATTTAAAG GATGCCTTTT TCTGCCTGAG . 3180 
ACTCCACCCC ACCAGTCAGC CTCTCTTCGC CTTTGAGTGG AGAGATCCAG AGATGGGAAT ' 3240 
CTCAGGACAA TTGACCTGGA CCAGACTCCC ACAGGGTTTC AAAAACAGTC CCACCCTGTT . 3300 

TGATGAGGCA CTGCACAGAG ACCTAGCAGA CTTCCGGATC CAGCACCCAG ACTTGATCCT 3360 

GCTACAGTAC GTGGATGACT TACTGCTGGC CGCCACTTCT GAGCTAGACT GCCAACAAGG 3420 

TACTGGGGCC CTGTTACAAA CCCTAGGGAA CCTCGGGTAT CGGGCCTCGG CCAAGAAAGC 3480 

CCAAATTTGC CAGAAACAGG TCAAGTATCT GGGGTATCTT CTAAAAGAGG GTCAGAGATG 3 540 

GCTGACTGAG GCCAGAAAAG AGACTGTGAT GGGGCAGCCT ACTCCGAAGA CCCCTCGACA 3 600 

ACTAAGGGAG TTCCTAGGGA CGGCAGGCTT CTGTCGCCTC TGGATCCCTG GGTTTGCAGA 3660 

AATGGCAGCC CCCTTGTACC CTCTCACCAA AACGGGGACT CTGTTTAATT GGGGCCCAGA 3720 

CCAACAAAAG GCCTATCAAG AAATCAAGCA AGCTCTTCTA ACTGCCCCAG CCCTGGGGTT 3780 

GCCAGATTTG ACTAAGCCCT TTGAACTCTT TGTCGACGAG AAGCAGGGCT ACGCCAAAGG 3840 

TGTCCTAACG CAAAAACTGG GACCTTGGCG TCGGCCGGTG GCCTACCTGT CCAAAAAGCT 3900 

AGACCCAGTA GCAGCTGGGT GGCCCCCTTG CCTACGGATG GTAGCAGCCA TTGCCGTACT 3960 

GACAAAGGAT GCAGGCAAGC TAACCATGGG ACAGCCACTA GTCATTCTGG CCCCCCATGC 4020 

AGTAGAGGCA CTAGTCAAAC AACCCCCCGA CCGCTGGCTT TCCAACGCCC GGATGACTCA 4080 
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CTATCAGGCC 
CCCGGCTACG 
GGCCGAAGCC 
CACCTGGTAC 
GGTGACCACC 
GCGGGCTGAA 
TGTTTATACT 
AAGGCGTGGG 
CCTACTAAAA 
AAAGGGACAC 
AGCCATCACA 
CTCAGAACAT 
TTATGATAAA 
TACTTTTGAA 
GGCTCTCCTA 
AAATATCACT 
ACAGGGAACT 
GATAAAGCCC 
CTGGATAGAA 
AGAGGAGATC 
CTTCGTCTCC 
TTGTGCATAC 
GACTTTAACT 
AGCCCTGTAC 
ATATGGGGCA 
CAGCCCCTCT 
ACCTCTGGCG 
AGTCGGCGAC 
AGGACCTTAC 
TTGGATACAC 
GACATGGCGC 
CTAATCCCCT 
GCGGCCCAGC 
GAAGTAGCGA 
GCAATTCGTA 
CGAGTAACTG 
GATTTTGACA 
CGAGTGGTAA 
TTTGTGTTAA 
CTCAAATATA 
AAAGACAGGA 
CCTATAGAGT 
GGAATGAAAG 
GCATGGAAAA 
ACAGTCGAGA 
AATTTCACAA 
AATGTATCTT 
CTCCTCTACT 
GTTAATTAGG 
AAGGGTAATT 
AAACAGCCCA 
AACACCCTGC 
CACATTTTCC 
AGGAACCCAG 
GTTCATCTGC 
GGTCCTGTAG 
ATGAAAATTT 
TGAATGCAAG 
CCACATCAAA 



TTGCTTTTGG 
CTGCTCCCAC 
CACGGAACCC 
ACGGATGGAA 
GAGACCGAGG 
CTGATAGCAC 
GATAGCCGTT 
TTGCTCACAT 
GCCCTCTTTC 
AGCGCCGAGG 
GAGACTCCAG 
TTTCATTACA 
ACAAAGAAGT 
TTATTAGACT 
GAGAGAAGCC 
GAGACCTGCA 
AGGGTCCGCG 
GGATTGTATG 
GCCTTCCCAA 
TTCCCCAGGT 
AAGGTGAGTC 
AGACCCCAAA 
AAATTAACGC 
CGAGCCCGCA 
CCCCCGCCCC 
CTCCAAGCTC 
GCAGCCTACC 
ACAGTGTGGG 
ACAGTCCTGC 
GCCGCCCACG 
GTTCAACGCT 
TAATTCTTCT 
CGGCCACCAT 
CAGAGAAGAT 
CGAAAACAGG 
TTTGTGCAGA 
CGATTGTAGC 
GTCCTTGTGG 
TAGAAATGAA 
CCCGAAATTA 
TATCAGTGGT 
ACGAGCCATA 
ACCCCACCTG 
ATACATAACT 
ACTTGTTTAT 
ATAAAGCATT 
ATCATGTCTG 
TGAGAGGACA 
TCACTTAACA 
TTAAAATATC 
CAAATGTCAA 
TCATCAAGAA 
CCACCTGTGT 
CACTCCACTG 
TGACTGTCAA 
TTTGCTAACA 
GACCCTTGAA 
TTTAACATAG 
ATATTTCCAC 



ACACGGACCG 
TGCCTGAGGA 
GACCCGACCT 
GCAGTCTCTT 
TAATCTGGGC 
TCACCCAGGC 
ATGCTTTTGC 
CAGAAGGCAA 
TGCCCAAAAG 
CTAGAGGCAA 
ACACCTCTAC 
CAGTGACTGA 
ATTGGGTCTA 
TTCTTCATCA 
ACAGTCCCTA 
AAGCTTGTGC 
GGCATCGGCC 
GCTATAAATA 
CCAAGAAAGA 
TCGGCATGCC 
AGACAGTGGC 
GCTCAGGCCA 
TTGCAACTGG 
ACACGCCGGG 
TTGTAAACTT 
ACTTACAGGC 
AAGAACAACT 
TCCGCCGACA 
TGACCACCCC 
TGAAGGCTGC 
CTCAAAACCC 
GATGCTCAGA 
GAAAACATTT 
TACAATGCTT 
AGAAATCATT 
AGCCATTGCG 
TGTTAGACAC 
TATGTGTAGG 
TGGCAAGTTA 
AAAGTTTTAC 
CCAGGCTCTA 
GATAAAATAA 
TAGGTTTGGC 
GAGAATAGAG 
TGCAGCTTAT 
TTTTTCACTG 
GATCCCCAGG 
TTCCAATCAT 
AAAAGGAAAT 
TGGGAAGTCC 
CAGCAGAAAC 
GCACTGTGGT 
AGGTTCCAAA 
GATAAGCATT 
CTGTAGCATT 
CACCCTGCAG 
TGGGTTTTCC 
CAGTTACCCC 
AGGTTAAGTC 



9/22 

GGTCCAGTTC 
AGGGCTGCAA 
AACGGACCAG 
ACAAGAGGGA 
TAAAGCCCTG 
CCTAAAGATG 
TACTGCCCAT 
AGAGATCAAA 
ACTTAGCATA 
CCGGATGGCT 
CCTCCTCATA 
TATAAAGGAC 
CCAAGGAAAA 
GCTGACTCAC 
CTACATGCTG 
ACAAGTCAAC 
CGGCACTCAT 
TCTTCTAGTT 
AACCGCCAAG 
TCAGGTATTG 
CGATCTGTTG 
GGTAGAAAGA 
CTCTAGAGAC 
CCCCCATGGC 
CCCTGACCCT 
TCTCTACTTA 
GGACCGACCG 
CCAGACTAAG 
CACCGCCCTC 
CGACCCCGGG 
CTTAAAAATA 
GGGGTCAGTA 
AACATTTCTC 
TATGAGGATA 
TCGGCAGTAC 
ATTGGTAGTG 
CCTTATTCTG 
GAGTTGATTT 
GTCAAAACTA 
CACCAAGCTT 
GTTTTGACTC 
AAGATTTTAT 
AAGCTAGCTT 
AAGTTCAGAT 
AATGGTTACA 
CATTCTAGTT 
AAGCTCCTCT 
AGGCTGCCCA 
TGGGTAGGGG 
CTTCCACTGC 
ATACAAGCTG 
TGCTGTGTTA 
ATATCTAGTG 
ATCCTTATCC 
TTTTGGGGTT 
CTCCAAAGGT 
AGCACCATTT 
AATAACCTCA 
CTCATTTAAA 



GGACCGGTGG 
GACAACTGCC 
CCGCTCCCAG 
CAGCGTAAGG 
CCAGCCGGGA 
GCAGAAGGTA 
ATCCATGGAG 
AATAAAGACG 
ATCCATTGTC 
GACCAAGCGG 
GAAAATTCAT 
CTAACCAAGT 
CCTGTGATGC 
CtCAGCTTCT 
AACCGGGATC 
GCCAGCAAGT 
TGGGAGATCG 
TTTATAGATA 
GTCGTAACCA 
GGAACTGACA 
GGGATTGATT 
ATGAATAGAA 
TGGGTGCTCC 
CTCACCCCAT 
GACATGACAA 
GTCCAGCACG 
GTGGTACCTC 
AACCTAGAAC 
AAAGTAGACG 
GGTGGACCAT 
AGGTTAACCC 
CTGCTTCGCC 
AACAAGATCT 
ATAAACATCA 
ATATTGAAGC 
CAGTTTCGAA 
ACGAAGTAGA 
CAGACTATGC 
CGATTGAAGA 
ATCGATTAGT 
AACAATATCA 
TTAGTCTCCA 
AAGTAACGCC 
CAAGGTCAGG 
AATAAAGCAA 



GTGTCCTCAT 
TCCACCCTCT 
TTTTTCACAG 
TGTGTTCCAG 
TCAGCTTTGC 
GTAATGTGCA 
TTTTCATTTT 
AAAACAGCCT 
ACAGTTTGAG 
TCCCCACCAA 
TCATGAGTTT 
GTTTTAACAG 
TTAGGCAAAG 



TAGCCCTGAA 

TTGATATCCT 

ACGCCGACCA 

CGGGAGCTGC 

CATCCGCTCA 

AGAAGCTAAA 

AAATATACAG 

AGATCTTGGC 

CAGGACATCA 

CCCGAAAGGC 

CACCCTACAC 

TGGGGGCCAT 

CTGACCAGTT 

CAAAAATGAA 

GAACACTCAA 

CTGCCGTTAA 

ATTTCACCGA 

CCTTTTCTGG 

AGAAGCTACT 

ATGGGCCTGC " 

GGAAATTACA . 

CCATCAAGGA 

TACTCCCCTT 

ATGAGATCTT 

GAGTTACTAA 

AAGTCTGGAG 

ACCCTTACCG 

CTCGCTGGAA 

GCATCGCAGC 

CCTCTAGACT 

GCGAGGCCCC 

CGGCTCCAGT 

AGAATTAGTA 

TGTGGGAGCG 

GTATATAGGA 

TGGACAAAAG 

TAGAAGTATT 

ACCAGATTGT 

ACTCATTCCA 

CCAATTTGTT 

CCAGCTGAAG 

GAAAAAGGGG 
ATTTTGCAAG 

AACAGATGGA 
TAGCATCACA 
CAAACTCATC 
AAACCCTAAC 
GTGTCCTCCT 
ACCGCTTTCT 
AAGTGTTGGT 
ACAAGGGCCC 
AAACAGGAGG 
TACTTGGATC 
TGTGGTCAGT 
CAGGATATTT 
CAGCAAAAAA 
TTTGTGTCCC 
TAACAGCTTC 
GAATTC 
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AGATCTCCCG 
AGCCAGTATC 
TAAGCTACAA 
CGTTTTGCGC 
AGTTATTAAT 
GTTACATAAC 
ACGTCAATAA 
TGGGTGGACT 
AGTACGCCCC 
ATGACCTTAT 
ATGGTGATGC 
TTTCCAAGTC 
GACTTTCCAA 
CGGTGGGAGG 
GCTTATCGAA 
CTTTTTCGCT 
TAGAATGGGA 
CTTTCTACTC 
TTCGTTAAAC 
AGATTGTAAG 
TACTTCAGCA 
GCATATAAAT 
CATCATCCTG 
ATACTCTGAG 
ACAGCTCCTG 
GCAAGCTTCT 
GAGAATATGG 
GTCGAGCGGA 
TGCTCTGCAG 
GACCTCATCA 
CAGGTCCCCT 
CCCTTTGTAC 
GAACCTCCTC 
GGCGCCAAAC 
ACAGAAGACC 
GGTGGAGAAG 
CGTGGGAGAC 
GCAGGAGGAA 
AAAAATAATA 
GTTCTCATCA 
ACCGGAGAAG 
GGGCGCCCCA 
TGGGATTACA 
GCGGGTCTCC 
CAAGGGCCCA 
TACACTCCTT 
TGGCAGTCTG 
ACGCTTGGAG 
GAAAGAGAGG 
GATGAGCAGA 
GCCACTGTCG 
CTCGATCGCG 
AAGAAACCAC 
TAGGGAGGTC 
CAACCCGTCA 
GGACCCCTAA 
TGGACCACGG 
GTACCAGACT 
ATCCACTTTG 
TTGACCCTAA 
TCTCTAGGGT 
GGACTGGCAG 
TCCATAAAAC 
AGACTGTTGG 
CCCGTTAAGA 
AAGCGGGTGG 
CCACCGTCCC 
CTCCACCCCA 



ATCCCCTATG 
TGCTCCCTGC 
CAAGGCAAGG 
TGCTTCGCGA 
AGTAATCAAT 
TTACGGTAAA 
TGACGTATGT 
ATTTACGGTA 
CTATTGACGT 
GGGACTTTCC 
GGTTTTGGCA 
TCCACCCCAT 
AATGTCGTAA 
TCTATATAAG 
ATGTCGACTG 
ATTGTAAAAT 
AGATGTCCCT 
TGTTGACAAC 
TTTAGCTTGC 
TACTTTCTCT 
CAGTTTTAGA 
TCTGGCTGGC 
CCTTTCTCTT 
TCCAAACCGG 
GGCAACGTGC 
GCAGCATCGT 
GCCAGACTGT 
TCGCTCACAA 
AATGGCCAAC 
CCCAGGTTAA 
ACATCGTGAC 
ACCCTAAGCC 
GTTCGACCCC 
CTAAACCTCA 
CCCCGCCTTA 
CGACCCCTGC 
GGGAGCCCCC 
ACGGACAGCT 
ACCCTTCTTT 
CCCATCAGCC 
AAAAACAACG 
CTCAACTGCC 
CCACCCAGGC 
AAAACGCGGG 
ATGAGTCTCC 
ATGACCCTGA 
CCCCAGACAT 
ATTTGGTTAG 
AACGTATCAG 
AAGAGAAAGA 
TTAGTGGACA 
ACCAGTGTGC 
GAGGACCTCG 
AGGGTCAGGA 
CCTTCCTGGT 
GTGATAAGTC 
ATCGCAAAGT 
GTCCCTATCC 
AGGGATCAGG 
ATATAGAAGA 
CCACATGGCT 
TTCGCCAAGC 
AATACCCCAT 
ACCAGGGAAT 
AACCAGGGAC 
AAGACATCCA 
ACCAGTGGTA 
CCAGTCAGCC 



GTCGACTCTC 
TTGTGTGTTG 
CTTGACCGAC 
TGTACGGGCC 
TACGGGGTCA 
TGGCCCGCCT 
TCCCATAGTA 
AACTGCCCAC 
CAATGACGGT 
TACTTGGCAG 
GTACATCAAT 
TGACGTCAAT 
CAACTCCGCC 
CAGAGCTCTC 
AGAACTTCAG 
TCATGTTATA 
TGTATCACCA 
CATTGTCTCC 
ATTTGTAACG 
AATCACTTTT 
GAACAATTGT 
GTGGAAATAT 
TATGGTTACA 
GCCCCTCTGC 
TGGTTGTTGT 
TCTGTGTTGT 
TACCACTCCC 
CCAGTCGGTA 
CTTTAACGTC 
GATCAAGGTC 
CTGGGAAGCC 
TCCGCCTCCT 
GCCTCGATCC 
AGTTCTTTCT 
TAGGGACCCA 
GGGAGAGGCA 
TGTGGCCGAC 
TCAATACTGG 
TTCTGAAGAT 
CACCTGGGAC 
GGTGCTCTTA 
CAATGAAGTC 
AGGTAGGAAC 
CAGAAGCCCC 
CTCGGCCTTC 
GGACCCAGGG 
TGGGAGAAAG 
AGAGGCAGAA 
GAGAGAAACA 
AAGAGATCGT 
GAAACAGGAT 
CTACTGCAAA 
GGGACCAAGA 
GCCCCCCCCT 
AGATACTGGG 
TGCCTGGGTC 
ACATCTAGCT 
TCTGTTAGGA 
AGCTCAGGTT 
TGAGCATCGG 
GTCTGATTTT 
TCCTCTGATC 
GTCACAAGAA 
ACTGGTACCC 
TAATGATTAT 
CCCCACCGTG 
CACTGTGCTT 
TCTCTTCGCC 



AGTACAATCT 
GAGGTCGCTG 
AATTGCATGA 
AGATATACGC 
TTAGTTCATA 
GGCTGACCGC 
ACGCCAATAG 
TTGGCAGTAC 
AAATGGCCCG 
TACATCTACG 
GGGCGTGGAT 
GGGAGTTTGT 
CCATTGACGC 
TGGCTAACTA 
GGTGAGTTTG 
TGGAGGGGGC 
TGGACCCTCA 
TCTTATTTTC 
AATTTTTAAA 
TTTTCAAGGC 
TATAATTAAA 
TCTTATTGGT 
ATGATATACA 
TAACCATGTT 
GCTGTCTCAT 
CTCTGTCTGA 
TTAAGTTTGA 
GATGTCAAGA 
GGATGGCCGC 
TTTTCACCTG 
TTGGCTTTTG 
CTTCCTCCAT 
TCCCTTTATC 
GACAGTGGGG 
AGACCACCCC 
CCGGACCCCT 
TCCACTACCT 
CCGTTCTCCT 
CCAGGTAAAC 
GACTGTCAGC 
GAGGCTAGAA 
GATGCCGCTT 
CACCTAGTCC 
ACCAATTTGG 
CTAGAGAGAC 
CAAGAAACTA 
TTAGAGAGGT 
AAGATCTTTA 
GAGGAAAAAG 
AGGAGACATA 
AGACAGGGAG 
GAAAAGGGGC 
CCCCAGACCT 
GAACCCAGGA 
GCCCAACACT 
CAAGGGGCTA 
ACCGGTAAGG 
AGAGATTTGC 
ATGGGACCAA 
CTACATGAGA 
CCTCAGGCCT 
ATACCTCTGA 
GCCAGACTGG 
TGCCAGTCCC 
AGGCCTGTCC 
CCCAACCCTT 
GATTTAAAGG 
TTTGAGTGGA 



GCTCTGATGC 
AGTAGTGCGC 
AGAATCTGCT 
GTTGACATTG 
GCCCATATAT 
CCAACGACCC 
GGACTTTCCA 
ATCAAGTGTA 
CCTGGCATTA 
TATTAGTCAT 
AGCGGTTTGA 
TTTGGCACCA 
AAATGGGCGG 
GAGAACCCAC 
GGGACCCTTG 
AAAGTTTTCA 
TGATAATTTT 
TTTTCATTTT 
TTCACTTTTG 
AATCAGGGTA 
TGATAAGGTA 
AGAAACAACT 
CTGTTTGAGA 
CATGCCTTCT 
CATTTTGGCA 
CTGTGTTTCT 
CCTTAGGTCA 
AGAGACGTTG 
GAGACGGCAC 
GCCCGCATGG 
ACCCCCCTCC 
CCGCCCCGTC 
CAGCCCTCAC 
GGCCGCTCAT 
CTTCCGACAG 
CCCCAATGGC 
CGCAGGCATT 
CTTCTGACCT 
TGACAGCTCT 
AGCTGTTGGG 
AGGCGGTGCG 
TTCCCCTCGA 
ACTATCGCCA 
CCAAGGTAAA 
TTAAGGAAGC 
ATGTGTCTAT 
TAGAAGATTT 
ATAAACGAGA 
AAGAACGCCG 
GAGAGATGAG 
GAGAACGAAG 
ACTGGGCTAA 
CCCTCCTGAC 
TAACCCTCAA 
CCGTGCTGAC 
CTGGAGGAAA 
TCACCCACTC 
TGACTAAACT 
TGGGGCAGCC 
CCTCAAAAGA 
GGGCGGAAAC 
AAGCAACCTC 
GGATCAAGCC 
CCTGGAACAC 
AGGATCTGAG 
ACAACCTCTT 
ATGCCTTTTT 
GAGATCCAGA 



CGCATAGTTA 
GAGCAAAATT 
TAGGGTTAGG 
ATTATTGACT 
GGAGTTCCGC 
CCGCCCATTG 
TTGACGTCAA 
TCATATGCCA 
TGCCCAGTAC 
CGCTATTACC 
CTCACGGGGA 
AAATCAACGG 
TAGGCGTGTA 
TGCTTAACTG 
ATTGTTCTTT 
GGGTGTTGTT 
GTTTCTTTCA 
CTGTAACTTT 
TTTATTTGTC 
TATTATATTG 
GAATATTTCT 
ACATCCTGGT 
TGAGGATAAA 
TCTTTTTCCT 
AGAATTGGCC 
GTATTTGTCT 
CTGGAAAGAT 
GGTTACCTTC 
CTTTAACCGA 
ACACCCAGAC 
CTGGGTCAAG 
TCTCCCCCTT 
TCCTTCTCTA 
CGACCTACTT 
GGACGGAAAT 
ATCTCGCCTA 
CCCCCTCCGC 
TTACAACTGG 
GATCGAGTCT 
GACTCTGCTG 
GGGCGATGAT 
GCGCCCAGAC 
GTTGCTCCTA 
AGGAATAACA 
CTATCGCAGG 
GTCTTTCATT 
AAAAAACAAG 
AACCCCGGAA 
TAGGACAGAG 
CAAGCTATTG 
GAGGTCCCAA 
AGATTGTCCC 
CCTAGATGAC 
AGTCGGGGGG 
CCAAAATCCT 
GCGGTATCGC 
TTTCCTCCAT 
AAAAGCCCAA 
CCTGCAAGTG 
GCCAGATGTT 
CGGGGGCATG 
TACCCCCGTG 
CCACATACAG 
GCCCCTGCTA 
AGAAGTCAAC 
GAGCGGGCTC 
CTGCCTGAGA 
GATGGGAATC 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
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1080 
1140 
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1800 
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2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
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2760 
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2940 
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3720 
3780 
3840 
3900 
3960 
4020 
4080 
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Figure 7. hCMV+intron Sequence 2 

TCAGGACAAT TGACCTGGAC CAGACTCCCA CAGGGTTTCA AAAACAGTCC CACCCTGTT'r ' 4140 

GATGAGGCAC TGCACAGAGA CCTAGCAGAC TTCCGGATCC AGCACCCAGA CTTGATCCTG 4200 

CTACAGTACG TGGATGACTT ACTGCTGGCC GCCACTTCTG AGCTAGACTG CCAACAAGGT 4260 

ACTCGGGCCC TGTTACAAAC CCTAGGGAAC CTCGGGTATC GGGCCTCGGC CAAGAAAGCC 4320 

CAAATTTGCC AGAAACAGGT CAAGTATCTG GGGTATCTTC TAAAAGAGGG TCAGAGATGG 4380 

CTGACTGAGG CCAGAAAAGA GACTGTGATG GGGCAGCCTA CTCCGAAGAC CCCTCGACAA 4440 

CTAAGGGAGT TCCTAGGGAC GGCAGGCTTC TGTCGCCTCT GGATCCCTGG GTTTGCAGAA 4500 

ATGGCAGCCC CCTTGTACCC TCTCACCAAA ACGGGGACTC TGTTTAATTG GGGCCCAGAC 4560 

CAACAAAAGG CCTATCAAGA AATCAAGCAA GCTCTTCTAA CTGCCCCAGC CCTGGGGTTG 4620 

CCAGATTTGA CTAAGCCCTT TGAACTCTTT GTCGACGAGA AGCAGGGCTA CGCCAAAGGT 4680 

GTCCTAACGC AAAAACTGGG ACCTTGGCGT CGGCCGGTGG CCTACCTGTC CAAAAAGCTA 4740 

GACCCAGTAG CAGCTGGGTG GCCCCCTTGC CTACGGATGG TAGCAGCCAT TGCCGTACTG 4800 

ACAAAGGATG CAGGCAAGCT AACCATGGGA CAGCCACTAG TCATTCTGGC CCCCCATGCA 4860 

GTAGAGGCAC TAGTCAAACA ACCCCCCGAC CGCTGGCTTT CCAACGCCCG GATGACTCAC 4920 

TATCAGGCCT TGCTTTTGGA CACGGACCGG GTCCAGTTCG GACCGGTGGT AGCCCTGAAC 4980 

CCGGCTACGC TGCTCCCACT GCCTGAGGAA GGGCTGCAAC ACAACTGCCT TGATATCCTG 5040 

GCCGAAGCCC ACGGAACCdG ACCCGACCTA ACGGACCAGC CGCTCCCAGA CGCCGACCAC 5100 

ACCTGGTACA CGGATGGAAG CAGTCTCTTA CAAGAGGGAC AGCGTAAGGC GGGAGCTGCG 5160 

GTGACCACCG AGACCGAGGT AATCTGGGCT AAAGCCCTGC CAGCCGGGAC ATCCGCTCAG 5220 

CGGGCTGAAC TGATAGCACT CACCCAGGCC CTAAAGATGG CAGAAGGTAA GAAGCTAAAT 5280 

GTTTATACTG ATAGCCGTTA TGCTTTTGCT ACTGCCCATA TCCATGGAGA AATATACAGA 5340 

AGGCGTGGGT TGCTCACATC AGAAGGCAAA GAGATCAAAA ATAAAGACGA GATCTTGGCC 5400 

CTACTAAAAG CCCTCTTTCT GCCCAAAAGA CTTAGCATAA TCCATTGTCC AGGACATCAA 5460 

AAGGGACACA GCGCCGAGGC TAGAGGCAAC CGGATGGCTG ACCAAGCGGC CCGAAAGGCA 5520 

GCCATCACAG AGACTCCAGA CACCTCTACC CTCCTCATAG AAAATTCATC ACCCTACACC 5580 

TCAGAACATT TTCATTACAC AGTGACTGAT ATAAAGGACC TAACCAAGTT GGGGGCCATT 5640 

TATGATAAAA CAAAGAAGTA TTGGGTCTAC CAAGGAAAAC CTGTGATGCC TGACCAGTTT 5700 

ACTTTTGAAT TATTAGACTT TCTTCATCAG CTGACTCACC TCAGCTTCTC AAAAATGAAG 5760 

GCTCTCCTAG AGAGAAGCCA CAGTCCCTAC TACATGCTGA ACCGGGATCG AACACTCAAA 5820 

AATATCACTG AGACCTGCAA AGCTTGTGCA CAAGTCAACG CCAGCAAGTC TGCCGTTAAA 5880 

CAGGGAACTA GGGTCCGCGG GCATCGGCCC GGCACTCATT GGGAGATCGA TTTCACCGAG 5940 

ATAAAGCCCG GATTGTATGG CTATAAATAT CTTCTAGTTT TTATAGATAC CTTTTCTGGC 6000 

TGGATAGAAG CCTTCCCAAC CAAGAAAGAA ACCGCCAAGG TCGTAACCAA GAAGCTACTA 6060 

GAGGAGATCT TCCCCAGGTT CGGCATGCCT CAGGTATTGG GAACTGACAA TGGGCCTGCC 6120 

TTCGTCTCCA AGGTGAGTCA GACAGTGGCC GATCTGTTGG GGATTGATTG GAAATTACAT 6180 

TGTGCATACA GACCCCAAAG CTCAGGCCAG GTAGAAAGAA TGAATAGAAC CATCAAGGAG 6240 

ACTTTAACTA AATTAACGCT TGCAACTGGC TCTAGAGACT GGGTGCTCCT ACTCCCCTTA 6300 

GCCCTGTACC GAGCCCGCAA CACGCCGGGC CCCCATGGCC TCACCCCATA TGAGATCTTA 63 60 

TATGGGGCAC CCCCGCCCCT TGTAAACTTC CCTGACCCTG ACATGACAAG AGTTACTAAC 6420 

AGCCCCTCTC TCCAAGCTCA CTTACAGGCT CTCTACTTAG TCCAGCACGA AGTCTGGAGA 6480 

CCTCTGGCGG CAGCCTACCA AGAACAACTG GACCGACCGG TGGTACCTCA CCCTTACCGA 6540 

GTCGGCGACA CAGTGTGGGT CCGCCGACAC CAGACTAAGA ACCTAGAACC TCGCTGGAAA 6600 

GGACCTTACA CAGTCCTGCT GACCACCCCC ACCGCCCTCA AAGTAGACGG CATCGCAGCT 6660 

TGGATACACG CCGCCCACGT GAAGGCTGCC GACCCCGGGG GTGGACCATC CTCTAGACTG 6720 

ACATGGCGCG TTCAACGCTC TCAAAACCCC TTAAAAATAA GGTTAACCCG CGAGGCCCCC 6780 

TAATCCCCTT AATTCTTCTG ATGCTCAGAG GGGTCAGTAC TGCTTCGCCC GGCTCCAGTG 6840 

CGGCCCAGCC GGCCACCATG AAAACATTTA ACATTTCTCA ACAAGATCTA GAATTAGTAG 6900 

AAGTAGCGAC AGAGAAGATT ACAATGCTTT ATGAGGATAA TAAACATCAT GTGGGAGCGG 6960 

CAATTCGTAC GAAAACAGGA GAAATCATTT CGGCAGTACA TATTGAAGCG TATATAGGAC 7020 

GAGTAACTGT TTGTGCAGAA GCCATTGCGA TTGGTAGTGC AGTTTCGAAT GGACAAAAGG 7080 

ATTTTGACAC GATTGTAGCT GTTAGACACC CTTATTCTGA CGAAGTAGAT AGAAGTATTC 7140 

GAGTGGTAAG TCCTTGTGGT ATGTGTAGGG AGTTGATTTC AGACTATGCA CCAGATTGTT 7200 

TTGTGTTAAT AGAAATGAAT GGCAAGTTAG TCAAAACTAC GATTGAAGAA CTCATTCCAC 72 60 

TCAAATATAC CCGAAATTAA AAGTTTTACC ACCAAGCTTA TCGAATTC 73 08 
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Figure 8. hCMV+intronkaSD Sequence 



AGATCTCCCG ATCCCCTATG GTCGACTCTC AGTACAATCT GCTCTGATGC CGCATAGTTA 
AGCCAGTATC TGCTCCCTGC TTGTGTGTTG GAGGTCGCTG AGTAGTGCGC GAGCAAAATT 
TAAGCTACAA CAAGGCAAGG CTTGACCGAC AATTGCATGA AGAATCTGCT TAGGGTTAGG 
CGTTTTGCGC TGCTTCGCGA TGTACGGGCC AGATATACGC GTTGACATTG ATTATTGACT 
AGTTATTAAT AGTAATCAAT TACGGGGTCA TTAGTTCATA GCCCATATAT GGAGTTCCGC 
GTTACATAAC TTACGGTAAA TGGCCCGCCT GGCTGACCGC CCAACGACCC CCGCCCATTG 
ACGTCAATAA TGACGTATGT TCCCATAGTA ACGCCAATAG GGACTTTCCA TTGACGTCAA 
TGGGTGGACT ATTTACGGTA AACTGCCCAC TTGGCAGTAC ATCAAGTGTA TCATATGCCA 
AGTACGCCCC CTATTGACGT CAATGACGGT AAATGGCCCG CCTGGCATTA TGCCCAGTAC 
ATGACCTTAT GGGACTTTCC TACTTGGCAG TACATCTACG TATTAGTCAT CGCTATTACC 
ATGGTGATGC GGTTTTGGCA GTACATCAAT GGGCGTGGAT AGCGGTTTGA CTCACGGGGA 
TTTCCAAGTC TCCACCCCAT TGACGTCAAT GGGAGTTTGT TTTGGCACCA AAATCAACGG 
GACTTTCCAA AATGTCGTAA CAACTCCGCC CCATTGACGC AAATGGGCGG TAGGCGTGTA 
CGGTGGGAGG TCTATATAAG CAGAGCTCTC TGGCTAACTA GAGAACCCAC TGCTTAACTG 
GCTTATCGAA ATGTCGACTG AGAACTTCAG GGTGAGTTTG GGGACCCTTG ATTGTTCTTT 
CTTTTTCGCT ATTGTAAAAT TCATGTTATA TGGAGGGGGC AAAGTTTTCA GGGTGTTGTT 
TAGAATGGGA AGATGTCCCT TGTATCACCA TGGACCCTCA TGATAATTTT GTTTCTTTCA 
CTTTCTACTC TGTTGACAAC CATTGTCTCC TCTTATTTTC TTTTCATTTT CTGTAACTTT 
TTCGTTAAAC TTTAGCTTGC ATTTGTAACG AATTTTTAAA TTCACTTTTG TTTATTTGTC 
AGATTGTAAG TACTTTCTCT AATCACTTTT TTTTCAAGGC AATCAGGGTA TATTATATTG 
TACTTCAGCA CAGTTTTAGA GAACAATTGT TATAATTAAA TGATAAGGTA GAATATTTCT 
GCATATAAAT TCTGGCTGGC GTGGAAATAT TCTTATTGGT AGAAACAACT ACATCCTGGT 
CATCATCCTG CCTTTCTCTT TATGGTTACA ATGATATACA CTGTTTGAGA TGAGGATAAA 
ATACTCTGAG TCCAAACCGG GCCCCTCTGC TAACCATGTT CATGCCTTCT TCTTTTTCCT 
ACAGCTCCTG GGCAACGTGC TGGTTGTTGT GCTGTCTCAT CATTTTGGCA AGAATTGGCC 
GCAAGCTTCT GCAGCATCGT TCTGTGTTGT CTCTGTCTGA CTGTGTTTCT GTATTTGTCT 
GAGAATATGG GCCAGACTGT TACCACTCCC TTAAGTTTGA CCTTAGGTCA CTGGAAAGAT 
GTCGAGCGGA TCGCTCACAA CCAGTCGGTA GATGTCAAGA AGAGACGTTG GGTTACCTTC 
TGCTCTGCAG AATGGCCAAC CTTTAACGTC GGATGGCCGC GAGACGGCAC CTTTAACCGA 
GACCTCATCA CCCAGGTTAA GATCAAGGTC TTTTCACCTG GCCCGCATGG ACACCCAGAC 
CAGGTCCCCT ACATCGTGAC CTGGGAAGCC TTGGCTTTTG ACCCCCCTCC CTGGGTCAAG 
CCCTTTGTAC ACCCTAAGCC TCCGCCTCCT CTTCCTCCAT CCGCCCCGTC TCTCCCCCTT 
GAACCTCCTC GTTCGACCCC GCCTCGATCC TCCCTTTATC CAGCCCTCAC TCCTTCTCTA 
GGCGCCAAAC CTAAACCTCA AGTTCTTTCT GACAGTGGGG GGCCGCTCAT CGACCTACTT 
ACAGAAGACC CCCCGCCTTA TAGGGACCCA AGACCACCCC CTTCCGACAG GGACGGAAAT 
GGTGGAGAAG CGACCCCTGC GGGAGAGGCA CCGGACCCCT CCCCAATGGC ATCTCGCCTA 
CGTGGGAGAC GGGAGCCCCC TGTGGCCGAC TCCACTACCT CGCAGGCATT CCCCCTCCGC 
GCAGGAGGAA ACGGACAGCT TCAATACTGG CCGTTCTCCT CTTCTGACCT TTACAACTGG 
AAAAATAATA ACCCTTCTTT TTCTGAAGAT CCAGGTAAAC TGACAGCTCT GATCGAGTCT 
GTTCTCATCA CCCATCAGCC GACCTGGGAC GACTGTCAGC AGCTGTTGGG GACTCTGCTG 
ACCGGAGAAG AAAAACAACG GGTGCTCTTA GAGGCTAGAA AGGCGGTGCG GGGCGATGAT 
GGGCGCCCCA CTCAACTGCC CAATGAAGTC GATGCCGCTT TTCCCCTCGA GCGCCCAGAC 
TGGGATTACA CCACCCAGGC AGGACGCAAC CACCTAGTCC ACTATCGCCA GTTGCTCCTA 
GCGGGTCTCC AAAACGCGGG CAGAAGCCCC ACCAATTTGG CCAAGGTAAA AGGAATAACA 
CAAGGGCCCA ATGAGTCTCC CTCGGCCTTC CTAGAGAGAC TTAAGGAAGC CTATCGCAGG 
TACACTCCTT ATGACCCTGA GGACCCAGGG CAAGAAACTA ATGTGTCTAT GTCTTTCATT 
TGGCAGTCTG CCCCAGACAT TGGGAGAAAG TTAGAGAGGT TAGAAGATTT AAAAAACAAG 
ACGCTTGGAG ATTTGGTTAG AGAGGCAGAA AAGATCTTTA ATAAACGAGA AACCCCGGAA 
GAAAGAGAGG AACGTATCAG GAGAGAAACA GAGGAAAAAG AAGAACGCCG . TAGGACAGAG 
GATGAGCAGA AAGAGAAAGA AAGAGATCGT AGGAGACATA GAGAGATGAG CAAGCTATTG 
GCCACTGTCG TTAGTGGACA GAAACAGGAT AGACAGGGAG GAGAACGAAG GAGGTCCCAA 
CTCGATCGCG ACCAGTGTGC CTACTGCAAA GAAAAGGGGC ACTGGGCTAA AGATTGTCCC 
AAGAAACCAC GAGGACCTCG GGGACCAAGA CCCCAGACCT CCCTCCTGAC CCTAGATGAC 
TAGGGAGGTC AGGGTCAGGA GCCCCCCCCT GAACCCAGGA TAACCCTCAA AGTCGGGGGG 
CAACCCGTCA CCTTCCTGGT AGATACTGGG GCCCAACACT CCGTGCTGAC CCAAAATCCT 
GGACCCCTAA GTGATAAGTC TGCCTGGGTC CAAGGGGCTA CTGGAGGAAA GCGGTATCGC 
TGGACCACGG ATCGCAAAGT ACATCTAGCT ACCGGTAAGG TCACCCACTC TTTCCTCCAT 
GTACCAGACT GTCCCTATCC TCTGTTAGGA AGAGATTTGC TGACTAAACT AAAAGCCCAA 
ATCCACTTTG AGGGATCAGG AGCTCAGGTT ATGGGACCAA TGGGGCAGCC CCTGCAAGTG 
TTGACCCTAA ATATAGAAGA TGAGCATCGG CTACATGAGA CCTCAAAAGA GCCAGATGTT 
TCTCTAGGGT CCACATGGCT GTCTGATTTT CCTCAGGCCT GGGCGGAAAC CGGGGGCATG 
GGACTGGCAG TTCGCCAAGC TCCTCTGATC ATACCTCTGA AAGCAACCTC TACCCCCGTG 
TCCATAAAAC AATACCCCAT GTCACAAGAA GCCAGACTGG GGATCAAGCC CCACATACAG 
AGACTGTTGG ACCAGGGAAT ACTGGTACCC TGCCAGTCCC CCTGGAACAC GCCCCTGCTA 
CCCGTTAAGA AACCAGGGAC TAATGATTAT AGGCCTGTCC AGGATCTGAG AGAAGTCAAC 
AAGCGGGTGG AAGACATCCA CCCCACCGTG CCCAACCCTT ACAACCTCTT GAGCGGGCTC 
CCACCGTCCC ACCAGTGGTA CACTGTGCTT GATTTAAAGG ATGCCTTTTT CTGCCTGAGA 
CTCCACCCCA CCAGTCAGCC TCTCTTCGCC TTTGAGTGGA GAGATCCAGA GATGGGAATC 
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Figure 8, hCMV+intronkaSD Sequence 2 

TCAGGACAAT TGACCTGGAC CAGACTCCCA CAGGGTTTCA AAAACAGTCC CACCCTGTTT ' 4140 
GATGAGGCAC TGCACAGAGA CCTAGCAGAC TTCCGGATCC AGCACCCAGA CTTGATCCTG * 4200 

CTACAGTACG TGGATGACTT ACTGCTGGCC GGCACTTCTG AGCTAGACTG CCAACAAGGT 4260 

ACTCGGGCCC TGTTACAAAC CCTAGGGAAC CTCGGGTATC GGGCCTCGGC CAAGAAAGCC 4320 

CAAATTTGCC AGAAACAGGT CAAGTATCTG GGGTATCTTC TAAAAGAGGG TCAGAGATGG 4380 

CTGACTGAGG CCAGAAAAGA GACTGTGATG GGGCAGCCTA CTCCGAAGAC CCCTCGACAA 4440 

CTAAGGGAGT TCCTAGGGAC GGCAGGCTTC TGTCGCCTCT GGATCCCTGG GTTTGCAGAA 4500 

ATGGCAGCCC CCTTGTACCC TCTCACCAAA ACGGGGACTC TGTTTAATTG GGGCCC^GAC 4560 

CAACAAAAGG CCTATCAAGA AATCAAGCAA GCTCTTCTAA CTGCCCCAGC CCTGGGGTTG 4620 

CCAGATTTGA CTAAGCCCTT TGAACTCTTT GTCGACGAGA AGCAGGGCTA CGCCAA^GGT 4680 

GTCCTAACGC AAAAACTGGG ACCTTGGCGT CGGCCGGTGG CCTACCTGTC CAAAAAGCTA 4740 

GACCCAGTAG CAGCTGGGTG GCCCCCTTGC CTACGGATGG TAGCAGCCAT TGCCGTACTG 4800 

ACAAAGGATG CAGGCAAGCT AACCATGGGA CAGCCACTAG TCATTCTGGC CCCCCATGCA 4860 

GTAGAGGCAC TAGTCAAACA ACCCCCCGAC CGCTGGCTTT CCAACGCCCG GATGACTCAC 4920 

TATCAGGCCT TGCTTTTGGA CACGGACCGG GTCCAGTTCG GACCGGTGGT AGCCCTGAAC 4980 

CCGGCTACGC TGCTCCCACT GCCTGAGGAA GGGCTGCAAC ACAACTGCCT TGATATCCTG 5040 

GCCGAAGCCC ACGGAACCCG ACCCGACCTA ACGGACCAGC CGCTCCCAGA CGCCGACCAC 5100 

ACCTGGTACA CGGATGGAAG CAGTCTCTTA CAAGAGGGAC AGCGTAAGGC GGGAGC TGCG 5160 

GTGACCACCG AGACCGAGGT AATCTGGGCT AAAGCCCTGC CAGCCGGGAC ATCCGCTCAG 5220 

CGGGCTGAAC TGATAGCACT CACCCAGGCC CTAAAGATGG CAGAAGGTAA GAAGCTAAAT 5280 

GTTTATACTG ATAGCCGTTA TGCTTTTGCT ACTGCCCATA TCCATGGAGA AATATACAGA 5340 

AGGCGTGGGT TGCTCACATC AGAAGGCAAA GAGATCAAAA ATAAAGACGA GATCTTGGCC 5400 

CTACTAAAAG CCCTCTTTCT GCCCAAAAGA CTTAGCATAA TCCATTGTCC AGGACATCAA 5460 

AAGGGACACA GCGCCGAGGC TAGAGGCAAC CGGATGGCTG ACCAAGCGGC CCGAAAGGCA 5520 

GCCATCACAG AGACTCCAGA CACCTCTACC CTCCTCATAG AAAATTCATC ACCCTACACC 5580 

TCAGAACATT TTCATTACAC AGTGACTGAT ATAAAGGACC TAACCAAGTT GGGGGCCATT 5640 

TATGATAAAA CAAAGAAGTA TTGGGTCTAC CAAGGAAAAC CTGTGATGCC TGACCAGTTT 5700 

ACTTTTGAAT TATTAGACTT TCTTCATCAG CTGACTCACC TCAGCTTCTC AAAAATGAAG 5760 

GCTCTCCTAG AGAGAAGCCA CAGTCCCTAC TACATGCTGA ACCGGGATCG AACACTCAAA 5820 

AATATCACTG AGACCTGCAA AGCTTGTGCA CAAGTCAACG CCAGCAAGTC TGCCGTTAAA 5880 

CAGGGAACTA GGGTCCGCGG GCATCGGCCC GGCACTCATT GGGAGATCGA TTTCACCGAG 5940 

ATAAAGCCCG GATTGTATGG CTATAAATAT CTTCTAGTTT TTATAGATAC CTTTTCTGGC 6000 

TGGAT AGAAG CCTTCCCAAC CAAGAAAGAA ACCGCCAAGG TCGTAACCAA GAAGCTACTA 6060 

GAGGAGATCT TCCCCAGGTT CGGCATGCCT CAGGTATTGG GAACTGACAA TGGGCCTGCC 6120 

TTCGTCTCCA AGGTGAGTCA GACAGTGGCC GATCTGTTGG GGATTGATTG GAAATTACAT 6180 ' 

TGTGCATACA GACCCCAAAG CTCAGGCCAG GTAGAAAGAA TGAATAGAAC CATCAAGGAG 6240 

ACTTTAACTA AATTAACGCT TGCAACTGGC TCTAGAGACT GGGTGCTCCT ACTCCCCTTA 6300 

GCCCTGTACC GAGCCCGCAA CACGCCGGGC CCCCATGGCC TCACCCCATA TGAGATCTTA 6360 

TATGGGGCAC CCCCGCCCCT TGTAAACTTC CCTGACCCTG ACATGACAAG AGTTACTAAC 6420 

AGCCCCTCTC TCCAAGCTCA CTTACAGGCT CTCTACTTAG TCCAGCACGA AGTCTGGAGA 6480 

CCTCTGGCGG CAGCCTACCA AGAACAACTG GACCGACCGG TGGTACCTCA CCCTTACCGA 6540 

GTCGGCGACA CAGTGTGGGT CCGCCGACAC CAGACTAAGA ACCTAGAACC TCGCTGGAAA 6600 

GGACCTTACA CAGTCCTGCT GACCACCCCC ACCGCCCTCA AAGTAGACGG CATCGCAGCT 6660 

TGGATACACG CCGCCCACGT GAAGGCTGCC GACCCCGGGG GTGGACCATC CTCTAGACTG 6720 

ACATGGCGCG TTCAACGCTC TCAAAACCCC TTAAAAATAA GGTTAACCCG CGAGGCCCCC 6780 

TAATCCCCTT AATTCTTCTG ATGCTCAGAG GGGTCAGTAC TGCTTCGCCC GGCTCCAGTG 6840 

CGGCCCAGCC GGCCACCATG AAAACATTTA ACATTTCTCA ACAAGATCTA GAATTAGTAG 6900 

AAGTAGCGAC AGAGAAGATT ACAATGCTTT ATGAGGATAA TAAACATCAT GTGGGAGCGG 6960 

CAATTCGTAC GAAAACAGGA GAAATCATTT CGGCAGTACA TATTGAAGCG . TATATAGGAC 7020 

GAGTAACTGT TTGTGCAGAA GCCATTGCGA TTGGTAGTGC AGTTTCGAAT GGACAAAAGG 7080 

ATTTTGACAC GATTGTAGCT GTTAGACACC CTTATTCTGA CGAAGTAGAT AGAAGTATTC 7140 

GAGTGGTAAG TCCTTGTGGT ATGTGTAGGG AGTTGATTTC AGACTATGCA CCAGATTGTT 7200 

TTGTGTTAAT AGAAATGAAT GGCAAGTTAG TCAAAACTAC GATTGAAGAA CTCATTCCAC 7260 

TCAAATATAC CCGAAATTAA AAGTTTTACC ACCAAGCTTA TCGAATTC 7308 
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CATATGCGGT 
TCGCCATTCA 
CGCCAGCTGG 
TCCCAGTCAC 
GACAGGATCT 
CTAGAATACG 
GAAAGACCCC 
GAAAAATACC 
AACGTTGGGC 
ACAGATGGTC 
GGCCCAACCC 
TGAAATGACC 
GCGCTTCTGC 
CTCCGATAGA 
CCGACTCGTG 
CGGGGGTCTT 
ACCCACCACC 
ACTTCCCTGA 
AGGCTCTCTA 
AACTGGACCG 
GACATCAAAC 
CCCCCACCGC 
CGGCCGACAC 
ACCCCCTCAA 
AGTAGGGATG 
GATGACTGGG 
AAAATTATAT 
ACCGTATGTC 
CTTTTACGTG 
CTGTGGTAAA 
GGACCTAATC 
CTGTGGCCCC 
GGGCAGATGC 
CGGGCCCAAA 
CTCCCTGACC 
ATTACCCGAC 
TAGCCCCCTC 
AAGTCCAAGT 
AGGAGCCTAT 
CTTAGTGTCG 
TTCCACCGCT 
GACAGGACAG 
CACCCAAAGC 
TTGCAGCACT 
TTGTGTATTA 
TCAGCTTGAA 
ACTAGGAGGA 
AATTAAAACC 
CGAAAAGTCA 
GAACCGCAGA 
AGAAGAATGT 
AAGAGAAAGG 
GCTGTTTAAT 
AGTACTCTTA 
TAAAGACAGG 
GCCTATAGAG 
CGGCATAGTA 
TTCCGGTGCT 
GGTTCTCCCG 
TGTTCATCAG 
TGCGCGGCCT 
ACGCCTCCGG 
TGCGCGACCC 
ACCGGTCGAC 
ATCACAAATT 
CTCATCAATG 
TNNNNACATG 
GGCGTTTTTC 



GTGAAATACC 
GGCTGCGCAA 
CGAAAGGGGG 
GACGTTGTAA 
CAGTAGTCCA 
AGCCACAATA 
ACCAAATTGC 
AAACCAAGAA 
CAAACAGGAT 
ACCGCGGTTC 
TCAGCAGTTT 
CTGTGCCTTA 
TTCCCGAGCT 
CTGAGTCGCC 
GTCTCGCTGT 
TCATTTGGGG 
GGGAGGTAAG 
CCCTGACATG 
CTTAGTCCAG 
GCCGGTGGTG 
CAAGAACCTA 
CCTCAAAGTA 
CGAGAGTGGA 
GATAAGATTA 
GCAGAGAGCC 
CGTACCGCCA 
TTTGATCTAT 
GGGTATGGCT 
TGCCCTGGGC 
TGGGGGTGTG 
TCCCTTAAGC 
TGCTACGACC 
AACCCTCTAG 
TCGTGGGGAC 
CGGCAGGTCC 
CAAAGACTCC 
AATACCAGTT 
GTCCCACAGC 
CAGGCGCTTA 
GGACCTCCTT 
CCGGCCAACT 
GGCCTATGCA 
GCCGGCTCAG 
GGATTGACTC 
GTTGAACTCT 
CAGCGTACCA 
TTAACCATGG 
CAGCAGTTTG 
ATTACCAACC 
GGCCTAGATT 
TGTTTTTATG 
CTTAATCAGA 
AGATCCCCCT 
CTGATCTTAC 
ATCTCAGTAG 
TACGAGCCAT 
TAATACGACT 
CACCGCGCGC 
GGACTTCGTG 
CGCGGTCCAG 
GGACGAGCTG 
GCCGGCCATG 
GGCCGGCAAC 
TTGTTAACTT 
TCACAAATAA 
TATCTTATCA 
TGAGCAAAAG 
CATAGGCTCC 



GCACAGATGC 
CTGTTGGGAA 
ATGTGCTGCA 
AACGACGGCC 
GGCTTTAGTC 
AATAAAAGAT 
TTAGCCTGAT 
TAGAGAAGTT 
ATCTGCGGTG 
GGCCCCGGCC 
CTTAAGACCC 
TTTGAATTAA 
CTATAAAAGA 
CGGGTACCCG 
TCCTTGGGAG 
GCTCGTCCGG 
CTGGCCAAGA 
ACCAGAGTTA 
CACGAAGTTT 
CCTCACCCTT 
GAACCTCGCT 
GACGGTATCG 
CCATCCTCTG 
ACCCGTGGAA 
CCCATCAGGT 
ATGCCACCTC 
GTGATCTGGT 
GCAAGTACCC 
ATACCGTAAA 
AAACCACCGG 
GCGGTAACAC 
TCTCCAAAGT 
TCCTAGAATT 
TGAGACTGTA 
TTAATGTGGG 
CTTCCTCACC 
ACCCCCCTTC 
CACCCCCAGG 
ACCTCACCAA 
ATTACGAAGG 
GTACGGCCAC 
TGGGGGCAGT 
GATCCTACTA 
CCTGCTTGTC 
GGCCCAGAGT 
AATATAAAAG 
GAGGGATTGC 
AGCAGCTTCA 
TAGAAAAGTC 
TGCTATTCCT 
CAGACCACAC 
GACAAAAACT 
GGTTTACCAC 
TCTTTGGACC 
TCCAGGCTTT 
AGGGCGCCTA 
CACTATAGGA 
GACGTCGCCG 
GAGGACGACT 
GACCAGGTGG 
TACGCCGAGT 
ACCGAGATCG 
TGCGTGCACT 
GTTTATTGCA 
AGCATTTTTT 
TGTCTGGATC 
GCCAGCAAAA 
GCCCCCCTGA 



GTAAGGAGAA 
GGGCGATCGG 
AGGCGATTAA 
AGTGAATTCC 
CTGACTCAAC 
TTTATTTAGT 
AGCCGCAGTA 
CAGATCAAGG 
AGCAGTTTCG 
CGGGGCCAAG 
ATCAGATGTT 
CCAATCAGCC 
GCTCACAACC 
TGTATCCAAT 
GGTCTCCTCA 
GATCTGGAGA 
TCTTATATGG 
CTAACAGCCC 
GGAGACCACT 
ACCGGGTCGG 
GGAAAGGACC 
CAGCTTGGAT 
GACGGACATG 
GCCCTTAATA 
CTTTAATGTA 
CCTCCTGGGA 
CGGAGAGGAG 
CGCAGGGAGA 
GTCGGGGTGT 
ACAGGCTTAC 
CCCCTGGGAC 
ATCCAATTCC 
CACTGATGCA 
CCGGACAGGA 
ACCCCGAGTC 
AATAGAGATT 
CACTACCAGT 
AACTGGAGAT 
TCCCGACAAG 
AGTAGCGGTC 
TTCCCAACAT 
ACCTAAAACT 
CCTTGCAGCA 
CACCACGGTG 
AATTTACCAC 
AGAGCCAGTA 
AGCTGGAATA 
TGCCGCTATC 
ACTGACCTCG 
AAAGGAGGGA 
GGGGCTAGTG 
ATTTGAGACA 
CTTAATCTCC 
TTGCATTCTC 
AGTCCTGACT 
GTGTTGACAA 
GGGCCACCAT 
GAGCGGTCGA 
TCGCCGGTGT 
TGCCGGACAA 
GGTCGGAGGT 
GCGAGCAGCC 
TCGTGGCCGA 
GCTTATAATG 
TCACTGCATT 
CAGATCTGGG 
GGCCAGGAAC 
CGAGCATCAC 



AATACCGCAT 
TGCGGGCCTC 
GTTGGGTAAC 
GATTAGTTCA 
AATACCACCA 
TTCCAGAAAA 
ACGCCATTTT 
GCGGGTACAC 
GCCCCGGCCC 
AACAGATGGT 
TCCAGGCTCC 
TGCTTCTCGC 
CCTCACTCGG 
AAATCCTCTT 
GAGTGATTGA 
CCCCTGCCCA 
GGCACCCCCG 
CTCTCTCCAA 
GGCGGCAGCT 
CGACACAGTG 
TTACACAGTC 
ACACGCAGCC 
GCGCGTTCAA 
GTCATGGGAG 
ACCTGGAGAG 
ACTGTACAAG 
TGGGACCCTT 
CAGCGGACCC 
GGGGGACCAG 
TGGAAGCCCA 
ACGGGATGCT 
TTCCAAGGGG 
GGAAAAAAGG 
ACAGATCCTA 
CCCATAGGGC 
GTACCGGCTC 
ACACCCTCAA 
AGACTACTAG 
ACCCAAGAAT 
GTGGGCACTT 
AAGCTTACCC 
CACCAGGCCT 
CCCGCCGGAA 
CTCAATCTAA 
TCCCCCGATT 
TCATTGACCC 
GGGACGGGGA 
CAGACAGACC 
TTGTCTGAAG 
GGTCTCTGCG 
AGAGACAGCA 
GGCCAAGGAT 
ACCATCATGG 
AATCGATTAG 
CAACAATACC 
TTAATCATCG 
GGCCAAGTTG 
GTTCTGGACC 
GGTCCGGGAC 
CACCCTGGCC 
CGTGTCCACG 
GTGGGGGCGG 
GGAGCAGGAC 
GTTACAAATA 
CTAGTTGTGG 
CCCATGCGGC 
CGTAAAAAGG 
AAAAATCGAC 



CAGGCGCCAT 
TTCGCTATTA 
GCCAGGGTTT 
ATTTGTTAAA 
GCTAAAACCA 
AGGGGGGAAT 
GCAAGGCATG 
GAAAACAGCT 
GGGGCCAAGA 
CCCCAGATAT 
CCCAAGGACC 
TTCTGTTCGC 
CGCGCCAGTC 
GCTGTTGCAT 
CTACCCGTCT 
GGGACCACCG 
CCCCTTGTAA 
GCTCACTTAC 
TACCAAGAAC 
TGGGTCCGCC 
CTGCTGACCA 
CACGTAAAGG 
CGCTCTCAAA 
TCCTGTTAGG 
TCACCAACCT 
ATGCCTTCCC 
CAGACCAGGA 
GGACTTTTGA 
GAGAGGGCTA 
CATCATCGTG 
CTAAAGTTGC 
CTACTCGAGG 
CTAACTGGGA 
TTACCATGTT 
CCAACCCAGT 
CACAGCCACC 
CCTCCCCTAC 
CTCTAGTCAA 
GTTGGCTGTG 
ATACCAATCA 
TATCTGAAGT 
TATGTAACAC 
CAATGTGGGC 
CCACAGATTA 
ATATGTATGG 
TGGCCCTTCT 
CCACTGCCTT 
TCAACGAAGT 
TAGTCCTACA 
CAGCCCTAAA 
TGGCCAAATT 
GGTTCGAAGG 
GACCTCTAAT 
TTCAATTTGT 
ACCAGCTAAA 
GCATAGTATA 
ACCAGTGCCG 
GACCGGCTCG 
GACGTGACCC 
TGGGTGTGGG 
AACTTCCGGG 
GAGTTCGCCC 
TGANNNNCGG 
AAGCAATAGC 
TTTGTCCAAA 
CGCGGATCGA 
CCGCGTTGCT 
GCTCAAGTCA 
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GAGGTGGCGA 
CGTGCGCTCT 
GGGAAGCGTG 
TCGCTCCAAG 
CGGTAACTAT 
CACTGGTAAC 
GTGGCCTAAC 
AGTTACCTTC 
CGGTGGTTTT 
TCCTTTGATC 
TTTGGTCATG 
TTTTAAATCA 
CAGTGAGGCA 
CGTCGTGTAG 
ACCGCGAGAC 
GGCCGAGCGC 
CCGGGAAGCT 
TACAGGCATC 
ACGATCAAGG 
TCCTCCGATC 
ACTGCATAAT 
CTCAACCAAG 
AATACGGGAT 
TTCTTCGGGG 
CACTCGTGCA 
AAAAACAGGA 
ACTCATACTC 
CGGATACATA 
CCGAAAAGTG 
TAGGCGTATC 
ACACATGCAG 
AGCCCGTCAG 
ATCAGAGCAG 



AACCCGACAG 

CCTGTTCCGA 

GCGCTTTCTC 

CTGGGCTGTG 

CGTCTTGAGT 

AGGATTAGCA 

TACGGCTACA 

GGAAAAAGAG 

TTTGTTTGCA 

TTTTCTACGG 

AGATTATCAA 

ATCTAAAGTA 

CCTATCTCAG 

ATAACTACGA 

CCACGCTCAC 

AGAAGTGGTC 

AGAGTAAGTA 

GTGGTGTCAC 

CGAGTTACAT 

GTTGTCAGAA 

TCTCTTACTG 

TCATTCTGAG 

AATACCGCGC 

CGAAAACTCT 

CCCAACTGAT 

AGGCAAAATG 

TTCCTTTTTC 

TTTGAATGTA 

CCACCTGACG 

ACGAGGCCCT 

CTCCCGGAGA 

GGCGCGTCAG 

ATTGTACTGA 



GACTATAAAG 

CCCTGCCGCT 

AATGCTCACG 

TGCACGAACC 

CCAACCCGGT 

GAGCGAGGTA 

CTAGAAGGAC 

TTGGTAGCTC 

AGCAGCAGAT 

GGTCTGACGC 

AAAGGATCTT 

TATATGAGTA 

CGATCTGTCT 

TACGGGAGGG 

CGGCTCCAGA 

CTGCAACTTT 

GTTCGCCAGT 

GCTCGTCGTT 

GATCCCCCAT 

GTAAGTTGGC 

TCATGCCATC 

AATAGTGTAT 

CACATAGCAG 

CAAGGATCTT 

CTTCAGCATC 

CCGCAAAAAA 

AATATTATTG 

TTTAGAAAAA 

TCTAAGAAAC 

TTCGTCTCGC 

CGGTCACAGC 

CGGGTGTTGG 

GAGTGCAC 



ATACCAGGCG 

TACCGGATAC 

CTGTAGGTAT 

CCCCGTTCAG 

AAGACACGAC 

TGTAGGCGGT 

AGTATTTGGT 

TTGATCCGGC 

TACGCGCAGA 

TCAGTGGAAC 

CACCTAGATC 

AACTTGGTCT 

ATTTCGTTCA 

CTTACCATCT 

TTTATCAGCA 

ATCCGCCTCC 

TAATAGTTTG 

TGGTATGGCT 

GTTGTGCAAA 

CGCAGTGTTA 

CGTAAGATGC 

GCGGCGACCG 

AACTTTAAAA 

ACCGCTGTTG 

TTTTACTTTC 

GGGAATAAGG 

AAGCATTTAT 

TAAACAAATA 

CATTATTATC 

GCGTTTCGGT 

TTGTCTGTAA 

CGGGTGTCGG 



TTTCCCCCTG 

CTGTCCGCCT 

CTCAGTTCGG 

CCCGACCGCT 

TTATCGCCAC 

GCTACAGAGT 

ATCTGCGCTC 

AAACAAACCA 

AAAAAAGGAT 

GAAAACTCAC 

CTTTTAAATT 

GACAGTTACC 

TCCATAGTTG 

GGCCCCAGTG 

ATAAACCAGC 

ATCCAGTCTA 

CGCAACGTTG 

TCATTCAGCT 

AAAGCGGTTA 

TCACTCATGG 

TTTTCTGTGA 

AGTTGCTCTT 

GTGCTCATCA 

AGATCCAGTT 

ACCAGCGTTT 

GCGACACGGA 

CAGGGTTATT 

GGGGTTCCGC 

ATGACATTAA 

GATGACGGTG 

GCGGATGCCG 

GGCTGGCTTA 



GAAGCTCCCT 

TTCTCCCTTC 

TGTAGGTCGT 

GCGCCTTATC 

TGGCAGCAGC 

TCTTGAAGTG 

TGCTGAAGCC 

CCGCTGGTAG 

CTCAAGAAGA 

GTTAAGGGAT 

AAAAATGAAG 

AATGCTTAAT 

CCTGACTCCC 

CTGCAATGAT 

CAGCCGGAAG 

TTAATTGTTG 

TTGCCATTGC 

CCGGTTCCCA 

GCTCCTTCGG 

TTATGGCAGC 

CTGGTGAGTA 

GCCCGGCGTC 

TTGGAAAACG 

CGATGTAACC 

CTGGGTGAGC 

AATGTTGAAT 

GTCTCATGAG 

GCACATTTCC 

CCTATAAAAA 

AAAACCTCTG 

GGAGCAGACA 

ACTATGCGGC 
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CATATGCGGT 
TCGCCATTCA 
CGCCAGCTGG 
TCCCAGTCAC 
GACAGGATCT 
CTAGAATACG 
GAAAGACCCC 
GAAAAATACC 
AACGTTGGGC 
ACAGATGGTC 
GGCCCAACCC 
TGAAATGACC 
GCGCTTCTGC 
CTCCGATAGA 
CCGACTCGTG 
CGGGGGTCTT 
ACCCACCACC 
ACTTCCCTGA 
AGGCTCTCTA 
AACTGGACCG 
GACACCAGAC 
CCCCCACCGC 
CTGCCGACCC 
ACCCCTTAAA 
CAGAGGGGTC 
GGAGGTAACC 
GACCTGGTGG 
TTGGGGGCTA 
GGGCAGCAGC 
GTGCAACACT 
ATTTTATGTT 
CTCCTTCTAC 
CTCATCATGG 
ATGCAAAGAT 
GGTTACTTCC 
TCCAGGGCTT 
AGGGCCAAAC 
GCCTTCAGTC 
GGGAACGGAA 
CAGTCCTGAC 
AGGGGTTGCC 
GGCCTCCCAA 
AGTTCCCAAA 
TTATCTAGTT 
CTCCACCACC 
AGTCACCTAT 
AAGAGAACCG 
TGCCGCTGGA 
CCAAGCCGCA 
GTCTCTCACT 
TCTAAAAGAA 
CACAGGACTA 
ACTGTTTGAG 
CACCTTGATA 
ACCCTGCATT 
TTTAGTCCTG 
CTAGTGTTGA 
GGAGGGCCAC 
CCGGAGCGGT 
ACTTCGCCGG 
TGGTGCCGGA 
AGTGGTCGGA 
TCGGCGAGCA 
ACTTCGTGGC 
GCAGCTTATA 
TTTTCACTGC 
ATCCAGATCT 
AAAGGCCAGG 



GTGAAATACC 
GGCTGCGCAA 
CGAAAGGGGG 
GACGTTGTAA 
CAGTAGTCCA 
AGCCACAATA 
ACCAAATTGC 
AAACCAAGAA 
CAAACAGGAT 
ACCGCGGTTC 
TCAGCAGTTT 
CTGTGCCTTA 
TTCCCGAGCT 
CTGAGTCGCC 
GTCTCGCTGT 
TCATTTGGGG 
GGGAGGTAAG 
CCCTGACATG 
CTTAGTCCAG 
ACCGGTGGTA 
TAAGAACCTA 
CCTCAAAGTA 
CGGGGGTGGA 
AATAAGGTTA 
AGTACTGCTT 
AATGGAGATC 
CCTGACCTTA 
GAATATCAAT 
CCAGGCTGTT 
GCCTGGAACA 
TGCCCCGGGC 
TGTGCCTATT 
GATTTCATCA 
AATAAGTGGT 
TGGACCACAG 
ACATTTGGGA 
CCCGTTCTGG 
ACCAAACCAC 
AATAGGCTGC 
AAAACCCAAG 
GTCCTGGGTA 
CACAAGTTGA 
ACACATCAGG 
GCCCCTACAG 
ATACTGAACC 
CATTCCCCCA 
GTGTCGTTAA 
ATAGGAACAG 
GTACAGGATG 
TCCCTGTCTG 
GGAGGGCTGT 
GTGAGAGACA 
TCAACTCAAG 
TCTACCATTA 
CTTAATCGAT 
ACTCAACAAT 
CAATTAATCA 
CATGGCCAAG 
CGAGTTCTGG 
TGTGGTCCGG 
CAACACCCTG 
GGTCGTGTCC 
GCCGTGGGGG 
CGAGGAGCAG 
ATGGTTACAA 
ATTCTAGTTG 
GGGCCCATGC 
AACCGTAAAA 



GCACAGATGC 
CTGTTGGGAA 
ATGTGCTGCA 
AACGACGGCC 
GGCTTTAGTC 
AATAAAAGAT 
TTAGCCTGAT 
TAGAGAAGTT 
ATCTGCGGTG 
GGCCCCGGCC 
CTTAAGACCC 
TTTGAATTAA 
CTATAAAAGA 
CGGGTACCCG 
TCCTTGGGAG 
GCTCGTCCGG 
CTGGCCAAGA 
ACAAGAGTTA 
CACGAAGTCT 
CCTCACCCTT 
GAACCTCGCT 
GACGGCATCG 
CCATCCTCTA 
ACCCGCGAGG 
CGCCCGGCTC 
GGGAGACGGT 
CCCCAGATTT 
CCCCTTTTTC 
CCAGAGACTG 
GACTCAAGCT 
CCCACCGCCC 
GGGGCTGTGA 
CAGTAAACAA 
GCAACCCCTT 
GACATTACTG 
TCCGACTCAG 
CAGACCAACA 
CCAGTGGGAC 
TAAACTTAGT 
AGTGCTGGTT 
CCTACTCCAA 
CCCTGTCCGA 
CCCTATGTAA 
GTACCATGTG 
TTACCACTGA 
GCTATGTTTA 
CCCTGGCCCT 
GGACTACTGC 
ATCTCAGGGA 
AAGTTGTCCT 
GTGCTGCTCT 
GCATGGCCAA 
GATGGTTTGA 
TGGGACCCCT 
TAGTTCAATT 
ACCACCAGCT 
TCGGCATAGT 
TTGACCAGTG 
ACCGACCGGC 
GACGACGTGA 
GCCTGGGTGT 
ACGAACTTCC 
CGGGAGTTCG 
GACTGANNNN 
ATAAAGCAAT 
TGGTTTGTCC 
GGCCGCGGAT 
AGGCCGCGTT 



GTAAGGAGAA 
GGGCGATCGG 
AGGCGATTAA 
AGTGAATTCC 
CTGACTCAAC 
TTTATTTAGT 
AGCCGCAGTA 
CAGATCAAGG 
AGCAGTTTCG 
CGGGGCCAAG 
ATCAGATGTT 
GCAATCAGCC 
GCTCACAACC 
TGTATCCAAT 
GGTCTCCTCA 
GATCTGGAGA 
TCTTATATGG 
CTAACAGCCC 
GGAGACCTCT 
ACCGAGTCGG 
GGAAAGGACC 
CAGCTTGGAT 
GACTGAGATG 
CCCCCTAATC 
CAGTCCTCAT 
ATGGGCAACT 
ATGTATGTTA 
TTCTCCCCCG 
CGAAGAACCT 
AGACCAGACA 
CCGAGAATCC 
GACAACCGGT 
CAATCTCACC 
AGTTATTCGG 
GGGCTTACGT 
ATACCAAAAT 
GCCACTCTCC 
TCCTCTCTCC 
AGACGGAGCC 
GTGTCTAGTA 
CCATACCTCT 
AGTGACCGGA 
TACCACCCAG 
GGCTTGTAGT 
TTATTGTGTT 
CGGCCTGTTT 
ATTATTGGGT 
TCTAATGGCC 
GGTTGAAAAA 
ACAGAATCGA 
AAAAGAAGAA 
ATTGAGAGAG 
GGGACTGTTT 
CATTGTACTC 
TGTTAAAGAC 
AAAGCCTATA 
ATACGGCATA 
CCGTTCCGGT 
TCGGGTTCTC 
CCCTGTTCAT 
GGGTGCGCGG 
GGGACGCCTC 
CCCTGCGCGA 
CGGACCGGTC 
AGCATCACAA 
AAACTCATCA 
CGATNNNNAC 
GCTGGCGTTT 



AATACCGCAT 
TGCGGGCCTC 
GTTGGGTAAC 
GATTAGTTCA 
AATACCACCA 
TTCCAGAAAA 
ACGCCATTTT 
GCGGGTACAC 
GCCCCGGCCC 
AACAGATGGT 
TCCAGGCTCC 
TGCTTCTCGC 
CCTCACTCGG 
AAATCCTCTT 
GAGTGATTGA 
CCCCTGCCCA 
GGCACCCCCG 
CTCTCTCCAA 
GGCGGCAGCC 
CGACACAGTG 
TTACACAGTC 
ACACGCCGCC 
GCGCGTTCAA 
CCCTTAATTC 
CAAGTCTATA 
TCTGGCAACC 
GCCCACCATG 
GGGCCCCCTT 
TTAACCTCCC 
ACTCATAAAT 
AAGTCATGTG 
AGAGCTTACT 
TCTGACCAGG 
TTTACAGACG 
TTGTATGTCT 
CTAGGACCCC 
AAGCCCAAAC 
CCTACCCAAC 
TACCAAGCCC 
GCGGGACCCC 
GCTCCAGCCA 
CAGGGACTCT 
ACAAGCAGTC 
ACCGGGCTTA 
CTTGTCGAAC 
GAGAGATCCA 
GGACTAACCA 
ACTCAGCAAT 
TCAATCTCTA 
AGGGGCCTAG 
TGTTGCTTCT 
AGGCTTAATC 
AACAGATCCC 
CTAATGATTT 
AGGATCTCAG 
GAGTACGAGC 
GTATAATACG 
GCTCACCGCG 
CCGGGACTTC 
CAGCGCGGTC 
CCTGGACGAG 
CGGGCCGGCC 
CCCGGCCGGC 
GACTTGTTAA 
ATTTCACAAA 
ATGTATCTTA 
ATGTGAGCAA 
TTCCATAGGC 



CAGGCGCCAT 
TTCGCTATTA 
GCCAGGGTTT 
ATTTGTTAAA 
GCTAAAACCA 
AGGGGGGAAT 
GCAAGGCATG 
GAAAACAGCT 
GGGGCCAAGA 
CCCCAGATAT 
CCCAAGGACC 
TTCTGTTCGC 
CGCGCCAGTC 
GCTGTTGCAT 
CTACCCGTCT 
GGGACCACCG 
CCCCTTGTAA 
GCTCACTTAC 
TACCAAGAAC 
TGGGTCCGCC 
CTGCTGACCA 
CACGTGAAGG 
CGCTCTCAAA 
TTCTGATGCT 
ATATCACCTG 
ACCCTCTGTG 
GACCATCTTA 
GTTGCTCAGG 
TCACCCCTCG 
CAAATGAGGG 
GGGGTCCAGA 
GGAAGCCCTC 
CTGTCCAGGT 
CCGGGAGACG 
CCGGACAAGA 
GCGTCCCAAT 
CTGTTAAGTC 
TTCCACCGGC 
TCAACCTCAC 
CCTACTACGA 
ACTGCTCCGT 
GCATAGGAGC 
GAGGGTCCTA 
CTCCATGCAT 
TCTGGCCAAG 
ACCGACACAA 
TGGGGGGAAT 
TCCAGCAGCT 
ACCTAGAAAA 
ACTTGTTATT 
ATGCGGACCA 
AGAGACAGAA 
CTTGGTTTAC 
TGCTCTTCGG 
TAGTCCAGGC 
CATAGGGCGC 
ACTCACTATA 
CGCGACGTCG 
GTGGAGGACG 
CAGGACCAGG 
CTGTACGCCG 
ATGACCGAGA 
AACTGCGTGC 
CTTGTTTATT 
TAAAGCATTT 
TCATGTCTGG 
AAGGCCAGCA 
TCCGCCCCCC 
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Figure 10. FBdelPMOSAF Sequence 2 

TGACC-AGCAT CACAAAAATC GACGCTCAAG TCAGAGGTGG CGAAACCCGA CAGGACTATA - 4140 

AAGATACCAG GCGTTTCCCC CTGGAAGCTC CCTCGTGCGC TCTCCTGTTC CGACCCTGCC 470 0 

GCTTACCGGA TACCTGTCCG CCTTTCTCCC TTCGGGAAGC GTGGCGCTTT CTCAATGCTC 47fio 

ACGCTGTAGG TATCTCAGTT CGGTGTAGGT CGTTCGCTCC AAGCTGGGCT GTGTGCACGA Ho n 

ACCCCCCGTT CAGCCCGACC GCTGCGCCTT ATCCGGTAAC TATCGTCTTG AGTCCAACCC linn 

GGTAAGACAC GACTTATCGC CACTGGCAGC AGCCACTGGT AACAGGATTA GCAGAGCGAG 4440 

GTATGTAGGC GGTGCTACAG AGTTCTTGAA GTGGTGGCCT AACTACGGCT ACACTAGAAG ' 4500 

GACAGTATTT GGTATCTGCG CTCTGCTGAA GCCAGTTACC TTCGGAAAAA GAGTTGGTAG 4560 

CTCTTGATCC GGCAAACAAA CCACCGCTGG TAGCGGTGGT TTTTTTGTTT GCAAGCAGCA 4620 

GATTACGCGC AGAAAAAAAG GATCTCAAGA AGATCCTTTG ATCTTTTCTA CGGGGTCTGA 4680 

CGCTCAGTGG AACGAAAACT CACGTTAAGG GATTTTGGTC ATGAGATTAT CAAAAAGGAT 4740 

CTTCACCTAG ATCCTTTTAA ATTAAAAATG AAGTTTTAAA TCAATCTAAA GTATATATGA 4800 

GTAAACTTGG TCTGACAGTT ACCAATGCTT AATCAGTGAG GCACCTATCT CAGCGATCTG 4860 

TCTATTTCGT TCATCCATAG TTGCCTGACT CCCCGTCGTG TAGATAACTA CGATACGGGA 4920 

GGGCTTACCA TCTGGCCCCA GTGCTGCAAT GATACCGCGA GACCCACGCT CACCGGCTCC 4980 

AGATTTATCA GCAATAAACC AGCCAGCCGG AAGGGCCGAG CGCAGAAGTG GTCCTGCAAC 5040 

TTTATCCGCC TCCATCCAGT CTATTAATTG TTGCCGGGAA GCTAGAGTAA GTAGTTCGCC 5100 

AGTTAATAGT TTGCGCAACG TTGTTGCCAT TGCTACAGGC ATCGTGGTGT CACGCTCGTC 5160 

GTTTGGTATG GCTTCATTCA GCTCCGGTTC CCAACGATCA AGGCGAGTTA CATGATCCCC 5220 

CATGTTGTGC AAAAAAGCGG TTAGCTCCTT CGGTCCTCCG ATCGTTGTCA GAAGTAAGTT 5280 

GGCCGCAGTG TTATCACTCA TGGTTATGGC AGCACTGCAT AATTCTCTTA CTGTCATGCC 5340 

ATCCGTAAGA TGCTTTTCTG TGACTGGTGA GTACTCAACC AAGTCATTCT GAGAATAGTG 5400 

TATGCGGCGA CCGAGTTGCT CTTGCCCGGC GTCAATACGG GATAATACCG CGCCACATAG 5460 

CAGAACTTTA AAAGTGCTCA TCATTGGAAA ACGTTCTTCG GGGCGAAAAC TCTCAAGGAT 5590 

CTTACCGCTG TTGAGATCCA GTTCGATGTA ACCCACTCGT GCACCCAACT GATCTTCAGC 5580 

ATCTTTTACT TTCACCAGCG TTTCTGGGTG AGCAAAAACA GGAAGGCAAA ATGCCGCAAA 5640 

AAAGGGAATA AGGGCGACAC GGAAATGTTG AATACTCATA CTCTTCCTTT TTCAATATTA 5700 

TTGAAGCATT TATCAGGGTT ATTGTCTCAT GAGCGGATAC ATATTTGAAT GTATTTAGAA 5760 

AAATAAACAA ATAGGGGTTC CGCGCACATT TCCCCGAAAA GTGCCACCTG ACGTCTAAGA 5820 

AACCATTATT ATCATGACAT TAACCTATAA AAATAGGCGT ATCACGAGGC CCTTTCGTCT 5880 

CGCGCGTTTC GGTGATGACG GTGAAAACCT CTGACACATG CAGCTCCCGG AGACGGTCAC 5940 

AGCTTGTCTG TAAGCGGATG CCGGGAGCAG ACAAGCCCGT CAGGGCGCGT CAGCGGGTGT 6000 

TGGCGGGTGT CGGGGCTGGC TTAACTATGC GGCATCAGAG CAGATTGTAC TGAGAGTGCA 6060 

6061 



C 
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CATATGCGGT 
TCGCCATTCA 
CGCCAGCTGG 
TCCCAGTCAC 
GACAGGATCT 
CTAGAATACG 
GAAAGACCCC 
GAAAAATACC 
AACGTTGGGC 
ACAGATGGTC 
GGCCCAACCC 
TGAAATGACC 
GCGCTTCTGC 
CTCCGATAGA 
CCGACTCGTG 
CGGGGGTCTT 
ACCCACCACC 
CGGCCTTTGT 
TACATTGTGC 
AAGAGACCTT 
CCTTAGCGCT 
TTCTCTATGG 
GATTTCTCCC 
GGGACCAGAT 
TCGGGGATCA 
GCCCATACCT 
GGGTCCATGC 
TGGAAAAGAC 
AATAAGAACC 
GTTGTCTGGG 
GATGTATGTG 
TCCTCTAAAC 
TGGGGAGCCA 
TACGTATGTC 
TCCCTATACT 
TCCTCAAAAG 
CAACAGTGTC 
AAATTATCCA 
CATCCAGGCG 
GGTCCTGACC 
CCTCTTCCCC 
GCGACTAGTG 
CCCACCACAG 
ACCAACCCAG 
GAAGCAATAG 
GGGACCCAAG 
GTGCCCTTTA 
CATCAGTATC 
TGCCTCTCCA 
CCTCGCATCT 
AGGACTAAAA 
GCGGGAATAG 
CTGACAAGCC 
AAGTTAGAGG 
GACTTGCTGT 
TACATAGACC 
AAAAGACAGT 
CCTTGGTTCA 
CTCATCCTCG 
GTAGTCCAGG 
CCATAGGGCG 
GACTCACTAT 
GCGCGACGTC 
CGTGGAGGAC 
CCAGGACCAG 
GCTGTACGCC 
CATGACCGAG 
CAACTGCGTG 



GTGAAATACC 
GGCTGCGCAA 
CGAAAGGGGG 
GACGTTGTAA 
CAGTAGTCCA 
AGCCACAATA 
ACCAAATTGC 
AAACCAAGAA 
CAAACAGGAT 
ACCGCGGTTC 
TCAGCAGTTT 
CTGTGCCTTA 
TTCCCGAGCT 
CTGAGTCGCC 
GTCTCGCTGT 
TCATTTGGGG 
GGGAGGTAAG 
TGCTCAGGTA 
GTATAGACCC 
GACCAAATTA 
GCTTAGGGCC 
AGGACCACCC 
TGTCTTATTT 
CAAAGAGGTG 
AGTGCTTGTC 
GGTGTTGCTG 
TTCTCACCTC 
TGATCATCCT 
CCCACCAGCC 
ATACAAAGGC 
CCTTGGCGGC 
GAGTCAGACC 
TAGGGTGCAG 
CCCGGGATGG 
GTAAAGAATG 
ACCTCATAAC 
ACCAGACCGG 
AGGACTGGAT 
TACAGTTCAC 
TCGTCCTTGT 
CAAGGGAAGC 
CACAAACTCC 
GCGACAGACT 
GGGCCACTGA 
CCTCATCAGG 
GAAAGCTCAC 
CCCATCAGCA 
TGCTCCCCTC 
CCTCAGTTTT 
ATTACTATCC 
GAGAGGCTGT 
GTACTGGTTC 
TCCAGATCGC 
ACTCACTGAC 
TTCTAAAAGA 
ACTCAGGTGC 
TAGAGCGCCA 
CTACCCTGCT 
GGCCATGCAT 
CTTTAGTCCT 
CCTAGTGTTG 
AGGAGGGCCA 
GCCGGAGCGG 
GACTTCGCCG 
GTGGTGCCGG 
GAGTGGTCGG 
ATCGGCGAGC 
CACTTCGTGG 



GCACAGATGC 
CTGTTGGGAA 
ATGTGCTGCA 
AACGACGGCC 
GGCTTTAGTC 
AATAAAAGAT 
TTAGCCTGAT 
TAGAGAAGTT 
ATCTGCGGTG 
GGCCCCGGCC 
CTTAAGACCC 
TTTGAATTAA 
CTATAAAAGA 
CGGGTACCCG 
TCCTTGGGAG 
GCTCGTCCGG 
CTGGCCAAGA 
AGTCAGGGAC 
CAGAGCTCAG 
GCCTTAGAGA 
AGGAATACCC 
CCCATACTTG 
ACTCACTTAA 
TATAAGCCTG 
AGACGCCATC 
ACTACCCCGA 
AAACCTGCAC 
CTTAAGCTGC 
CATGACCCTC 
AGTCCAGCCC 
TAGTCTTGAG 
TCCGGACTCA 
CTACCCTCGG 
CCGGACCCTT 
GGATTGTGAG 
TGTAAAATGG 
CTGGTGTAAC 
AACGGGAAAA 
CATTCGCTTA 
GGAACAAGGA 
GCCACCGCCA 
CACGGTGAGA 
TTTTGATCTT 
GTCTTGCTGG 
AGAGGTCGCC 
CCTCACTGAG 
TCTCTGCAAT 
CAACCATAGC 
TAATCAGACT 
TGAAGAAGTT 
CTCACTTACC 
AACTGCCTTA 
CATAGATGCT 
TTCCCTGTCC 
AGGTGGCCTC 
AGTACGGGAC 
GAAAAGCCAA 
ATCAACCATC 
CATCAATCGA 
GACTCAACAA 
ACAATTAATC 
CCATGGCCAA 
TCGAGTTCTG 
GTGTGGTCCG 
ACAACACCCT 
AGGTCGTGTC 
AGCCGTGGGG 
CCGAGGAGCA 



GTAAGGAGAA 
GGGCGATCGG 
AGGCGATTAA 
AGTGAATTCC 
CTGACTCAAC 
TTTATTTAGT 
AGCCGCAGTA 
CAGATCAAGG 
AGCAGTTTCG 
CGGGGCCAAG 
ATCAGATGTT 
CCAATCAGCC 
GCTCACAACC 
TGTATCCAAT 
GGTCTCCTCA 
GATCTGGAGA 
TCCCTAAGGT 
TGGCCACTCA 
GTCAGGTAGA 
CCGGTGGAAA 
CTGGCCGGTT 
AGTCTGGAGA 
AGGCTTTAGA 
GTACCGTAAC 
GACCCAGCAG 
CCGCGGTAAA 
CACCTTCGGC 
GTATTCGGCG 
ACTTGGCAGG 
CCTTGGACTT 
TCCTGGGATA 
GACTATACTG 
GCTAGGACTA 
TCAGAAGCTA 
ACCACGGGGA 
GACCAAAATA 
CCCCTTAAAA 
ACCTGGGGAT 
AAAATCACCA 
CCTCCTAGAA 
TCTCTCCCCG 
AAAACAATTG 
GTGCAGGGGG 
CTTTGTTTGG 
TACTCCACCG 
GTCTCAGGAC 
CAGACCCTAT 
TGGTGGGCTT 
AGAGATTTCT 
TTGTTACAGG 
CTAGCTGTTT 
ATTAAAGGAC 
GACCTCCGGG 
GAGGTAGTGC 
TGTGCGGCCC 
TCCATGAAAA 
AACTGGTATG 
GCTGGGCCCC 
TTAGTTCAAT 
TACCACCAGC 
ATCGGCATAG 
GTTGACCAGT 
GACCGACCGG 
GGACGACGTG 
GGCCTGGGTG 
CACGAACTTC 
GCGGGAGTTC 
GGACTGANNN 



AATACCGCAT 
TGCGGGCCTC 
GTTGGGTAAC 
GATTAGTTCA 
AATACCACCA 
TTCCAGAAAA 
ACGCCATTTT 
GCGGGTACAC 
GCCCCGGCCC 
AACAGATGGT 
TCCAGGCTCC 
TGCTTCTCGC 
CCTCACTCGG 
AAATCCTCTT 
GAGTGATTGA 
CCCCTGCCCA 
ACTCGGGTCA 
ACTGGGGATA 
AAGAATGAAC 
AGACTGGGTG 
TGGTTTAACT 
AACTTTGGGT 
AATTGTAAGG 
AATCCCTCAC 
CCTTGAGCCT 
AGTCGATGGT 
ACCAGATGAG 
GCGGCGGGAC 
TACTGTCCCA 
GGTGGCCCAC 
TCCCGGGAAC 
CCGCTTATAA 
GAATGGCAAG 
GAAGGTGCGG 
CCGGTTATTG 
GCGAATGGAC 
TAGATTTCAC 
TAAGATTCTA 
ACATGCCAGC 
CGTCCCTCGC 
ACTCTAACTC 
TTACCCTAAA 
CCTTCCTAAC 
CCATGGGCCC 
ACCTTGACCG 
ACGGGTTGTG 
CCATCAATTC 
GCAGCACTGG 
GTATCCAGGT 
CCTATGACAA 
TACTGGGGTT 
CTATAGACCT 
CCCTCCAAGA 
TCCAAAATAG 
TAAAGGAAGA 
AACTCAAAGA 
AAGGATGGTT 
TATTACTCCT 
TTGTTAAAGA 
TAAAGCCTAT 
TATACGGCAT 
GCCGTTCCGG 
CTCGGGTTCT 
ACCCTGTTCA 
TGGGTGCGCG 
CGGGACGCCT 
GCCCTGCGCG 
NCGGACCGGT 



CAGGCGCCAT 
TTCGCTATTA 
GCCAGGGTTT 
ATTTGTTAAA 
GCTAAAACCA 
AGGGGGGAAT 
GCAAGGCATG 
GAAAACAGCT 
GGGGCCAAGA 
CCCCAGATAT 
CCCAAGGACC 
TTCTGTTCGC 
CGCGCCAGTC 
GCTGTTGCAT 
CTACCCGTCT 
GGGACCACCG 
GACAATGGCC 
AATTGGAAGT 
AGAACAATTA 
ACCCTCCTTC 
CCTTATGAAA 
CCCGATGATA 
ACCCAAATCT 
CCGTTCCAGG 
CGGTGGAAAG 
ATTGCTGCCT 
TCCTGGGAGC 
GAGTCTGCAA 
AACTGGAGAC 
ACTTAAACCT 
CGATGTCTCG 
GCAAATCACC 
CTCTACCTTC 
GGGGCTAGAA 
GCTATCTAAA 
TCAAAAATTT 
AGACAAAGGA 
TGTGTCTGGA 
TGTGGCAGTA 
TCTCCCACCT 
CACAGCCCTG 
CACTCCGCCT 
CTTAAATGCT 
CCCTTATTAT 
GTGCCGCTGG 
CATAGGAAAG 
CTCCGGAGAC 
CCTCACCCCT 
CCAGCTGATT 
TTCTCACCCC 
GGGAATCACG 
CCAGCAAGGC 
CTCAGTCAGC 
GAGAGGCCTT 
GTGCTGTTTT 
AAAACTGGAT 
CAATAACTCC 
CCTTCTGTTG 
CAGGATCTCA 
AGAGTACGAG 
AGTATAATAC 
TGCTCACCGC 
CCCGGGACTT 
TCAGCGCGGT 
GCCTGGACGA 
CCGGGCCGGC 
ACCCGGCCGG 
CGACTTGTTA 
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Figure 11. FBdelPGASAF Sequence 2 

ACTTGTTTAT TGCAGCTTAT AATGGTTACA AATAAAGCAA TAGCATCACA AATTTCACAA 4140 
ATAAAGCATT TTTTTCACTG CATTCTAGTT GTGGTTTGTC CAAACTCATC AATGTATCTT * 4200 

ATCATGTCTG GATCCAGATC TGGGCCCATG CGGCCGCGGA TCGATNNNNA CATGTGAGC^ 42 60 

AAAGGCCAGC AAAAGGCCAG GAACCGTAAA AAGGCCGCGT TGCTGGCGTT TTTCCATAGG 4320 

CTCCGCCCCC CTGACGAGCA TCACAAAAAT CGACGCTCAA GTCAGAGGTG GCGAAACCCG 4380 

ACAGGACTAT AAAGATACCA GGCGTTTCCC CCTGGAAGCT CCCTCGTGCG CTCTCCTGTT 4440 

CCGACCCTGC CGCTTACCGG ATACCTGTCC GCCTTTCTCC CTTCGGGAAG CGTGGCGCTT 4500 

7CTCAATGCT CACGCTGTAG GTATCTCAGT TCGGTGTAGG TCGTTCGCTC CAAGCTGGGC 4560 

TGTGTGCACG AACCCCCCGT TCAGCCCGAC CGCTGCGCCT TATCCGGTAA CTATCGTCTT 4620 

GAGTCCAACC CGGTAAGACA CGACTTATCG CCACTGGCAG CAGCCACTGG TAACAGGATT 4680 

AGCAGAGCGA GGTATGTAGG CGGTGCTACA GAGTTCTTGA AGTGGTGGCC TAACTA.CGGC 4740 

TACACTAGAA GGACAGTATT TGGTATCTGC GCTCTGCTGA AGCCAGTTAC CTTCGGAAAA 4800 

AGAGTTGGTA GCTCTTGATC CGGCAAACAA ACCACCGCTG GTAGCGGTGG TTTTTTTGT^ 4860 

TGCAAGCAGC AGATTACGCG CAGAAAAAAA GGATCTCAAG AAGATCCTTT GATCTTTTCT 4920 

ACGGGGTCTG ACGCTCAGTG GAACGAAAAC TCACGTTAAG GGATTTTGGT CATGAGATTA 4980 

TCAAAAAGGA TCTTCACCTA GATCCTTTTA AATTAAAAAT GAAGTTTTAA ATCAATCTAA 5040 

AGTATATATG AGTAAACTTG GTCTGACAGT TACCAATGCT TAATCAGTGA GGCACCTATC 5100 

TCAGCGATCT GTCTATTTCG TTCATCCATA GTTGCCTGAC TCCCCGTCGT GTAGATAACT 5160 

ACGATACGGG AGGGCTTACC ATCTGGCCCC AGTGCTGCAA TGATACCGCG AGACCCACGC 5220 

TCACCGGCTC CAGATTTATC AGCAATAAAC CAGCCAGCCG GAAGGGCCGA GCGCAGAAGT 5280 

GGTCCTGCAA CTTTATCCGC CTCCATCCAG TCTATTAATT GTTGCCGGGA AGCTAGAGTA 5340 

AGTAGTTCGC CAGTTAATAG TTTGCGCAAC GTTGTTGCCA TTGCTACAGG CATCGTGGTG 5400 

TCACGCTCGT CGTTTGGTAT GGCTTCATTC AGCTCCGGTT CCCAACGATC AAGGCGAGTT 5460 

ACATGATCCC CCATGTTGTG CAAAAAAGCG GTTAGCTCCT TCGGTCCTCC GATCGTTGTC 5520 

AGAAGTAAGT TGGCCGCAGT GTTATCACTC ATGGTTATGG CAGCACTGCA TAATTCTCTT 5580 

ACTGTCATGC CATCCGTAAG ATGCTTTTCT GTGACTGGTG AGTACTCAAC CAAGTCATTC 5640 

TGAGAATAGT GTATGCGGCG ACCGAGTTGC TCTTGCCCGG CGTCAATACG GGATAATACC 5700 

GCGCCACATA GCAGAACTTT AAAAGTGCTC ATCATTGGAA AACGTTCTTC GGGGCGAAAA 5760 

CTCTCAAGGA TCTTACCGCT GTTGAGATCC AGTTCGATGT AACCCACTCG TGCACCCAAC 5820 

TGATCTTCAG CATCTTTTAC TTTCACCAGC GTTTCTGGGT GAGCAAAAAC AGGAAGGCAA 5880 

AATGCCGCAA AAAAGGGAAT AAGGGCGACA CGGAAATGTT GAATACTCAT ACTCTTCCTT 5940 

TTTCAATATT ATTGAAGCAT TTATCAGGGT TATTGTCTCA TGAGCGGATA CATATTTGAA 6000 

TGTATTTAGA AAAATAAACA AATAGGGGTT CCGCGCACAT TTCCCCGAAA AGTGCCACCT 6060 

GACGTCTAAG AAACCATTAT TATCATGACA TTAACCTATA AAAATAGGCG TATCACGAGG 6120 

CCCTTTCGTC TCGCGCGTTT CGGTGATGAC GGTGAAAACC TCTGACACAT GCAGCTCCCG 6180 " 

GAGACGGTCA CAGCTTGTCT GTAAGCGGAT GCCGGGAGCA GACAAGCCCG TCAGGGCGCG 6240 

TCAGCGGGTG TTGGCGGGTG TCGGGGCTGG CTTAACTATG CGGCATCAGA GCAGATTGTA 6300 

CTGAGAGTGC AC 53^2 
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CATATGCGGT GTGAAATACC GCACAGATGC GTAAGGAGAA AATACCGCAT CAGGCGCCAT 60 
TCGCCATTCA GGCTGCGCAA CTGTTGGGAA GGGCGATCGG TGCGGGCCTC TTCGCTATTA " 120 

CGCCAGCTGG CGAAAGGGGG ATGTGCTGCA AGGCGATTAA GTTGGGTAAC GCCAGGGTTT 180 

TCCCAGTCAC GACGTTGTAA AACGACGGCC AGTGAATTCC GATTAGTTCA ATTTGTTAAA 240 

GACAGGATCT CAGTAGTCCA GGCTTTAGTC CTGACTCAAC AATACCACCA GCTAAAACCA 300 

CTAGAATACG AGCCACAATA AATAAAAGAT TTTATTTAGT TTCCAGAAAA AGGGGGGAAT 360 

GAAAGACCCC ACCAAATTGC TTAGCCTGAT AGCCGCAGTA ACGCCATTTT GCAAGGCATG 420 

GAAAAATACC AAACCAAGAA TAGAGAAGTT CAGATCAAGG GCGGGTACAC GAAAACAGCT 480 

AACGTTGGGC CAAACAGGAT ATCTGCGGTG AGCAGTTTCG GCCCCGGCCC GGGGCCAAGA 540 

ACAGATGGTC ACCGCGGTTC GGCCCCGGCC CGGGGCCAAG AACAGATGGT CCCCAGATAT 600 

GGCCCAACCC TCAGCAGTTT CTTAAGACCC ATCAGATGTT TCCAGGCTCC CCCAAGGACC 660 

TGAAATGACC CTGTGCCTTA TTTGAATTAA CCAATCAGCC TGCTTCTCGC TTCTGTTCGC 720 

GCGCTTCTGC TTCCCGAGCT CTATAAAAGA GCTCACAACC CCTCACTCGG CGCGCCAGTC 780 

CTCCGATAGA CTGAGTCGCC CGGGTACCCG TGTATCCAAT AAATCCTCTT GCTGTTGCAT 840 

CCGACTCGTG GTCTCGCTGT TCCTTGGGAG GGTCTCCTCA GAGTGATTGA CTACCCGTCT 900 

CGGGGGTCTT TCATTTGGGG GCTCGTCCGG GATCTGGAGA CCCCTGCCCA GGGACCACCG 960 

ACCCACCACC GGGAGGTAAG CTGGCCAAGA TCCCCCGGGC TGCAGGAATT TATGAAATCC 1020 

TTTATGGGGG ACCCCCCCCT TTGTCAACCT TGCTCAATTC CTTCTCCCCC TCCGATCCTA 1080 

AGACTGATTT ACAAGCCCGA CTAAAAGGGC TGCAAGGCGT GCAGGCCCAA ATCTGGACAC 1140 

CCCTGGCCGA ATTGTACCGG CCAGGACATC CACAAACTAG CCACCCATTT CAGGTGGGAG 1200 

ACTCCGTGTA CGTCCGGCGG CACCGCTCTC AAGGATTGGA GCCTCGTTGG AAGGGACCTT 1260 

ACATCGTCCT GCTGACCACG CCCACCGCCA TAAAGGTTGA CGGGATCGCC GCCTGGATTC 1320 

ACGCATCGCA CGCCAAGGCA GCCCCAAAAA CCCCTGGACC AGAAACTCCC AAAACCTGGA 13 80 

AGCTCCGCCG TTCGGAGAAC CCTCTTAAGA TAAGACTCTC CCGTGTCTGA CTGCTAATCC 1440 

ACCTTGTCCC TGTACTAACC CAAAATGAAA CTCCCAACAG GAATGGTCAT TTTATGTAGC 1500 

CTAATAATAG TTCGGGCAGG GTTTGACGAC CCCCGCAAGG CTATCGCATT AGTACAAAAA 1560 

CAACATGGTA AACCATGCGA ATGCAGCGGA GGGCAGGTAT CCGAGGCCCC ACCGAACTCC 1620 

ATCCAACAGG TAACTTGCCC AGGCAAGACG GCCTACTTAA TGACCAACCA AAAATGGAAA 1680 

TGCAGAGTCA CTCCAAAAAT CTCACCTAGC GGGGGAGAAC TCCAGAACTG CCCCTGTAAC 1740 

ACTTTCCAGG ACTCGATGCA CAGTTCTTGT TATACTGAAT ACCGGCAATG CAGGCGAATT 1800 

AATAAGACAT ACTACACGGC CACCTTGCTT AAAATACGGT CTGGGAGCCT CAACGAGGTA 1860 

CAGATATTAC AAAACCCCAA TCAGCTCCTA CAGTCCCCTT GTAGGGGCTC TATAAATCAG 1920 

CCCGTTTGCT GGAGTGCCAC AGCCCCCATC CATATCTCCG ATGGTGGAGG ACCCCTCGAT 1980 

ACTAAGAGAG TGTGGACAGT CCAAAAAAGG CTAGAACAAA TTCATAAGGC TATGACTCCT 2040 

GAACTTCAAT ACCACCCCTT AGCCCTGCCC AAAGTCAGAG ATGACCTTAG CCTTGATGCA 2100 . 

CGGACTTTTG ATATCCTGAA TACCACTTTT AGGTTACTCC AGATGTCCAA TTTTAGCCTT 2160 

GCCCAAGATT GTTGGCTCTG TTTAAAACTA GGTACCCCTA CCCCTCTTGC GATACCCACT 2220 

CCCTCTTTAA CCTACTCCCT AGCAGACTCC CTAGCGAATG CCTCCTGTCA GATTATACCT 2280 

CCCCTCTTGG TTCAACCGAT GCAGTTCTCC AACTCGTCCT GTTTATCTTC CCCTTTCATT 2340 

AACGATACGG AACAAATAGA CTTAGGTGCA GTCACCTTTA CTAACTGCAC CTCTGTAGCC 2400 
AATGTCAGTA GTCCTTTATG TGCCCTAAAC GGGTCAGTCT TCCTCTGTGG AAATAACATG ■ 2460 

GCATACACCT ATTTACCCCA AAACTGGACC AGACTTTGCG TCCAAGCCTC CCTCCTCCCC 2520 

GACATTGACA TCAACCCGGG GGATGAGCCA GTCCCCATTC CTGCCATTGA TCATTATATA 2580 

CATAGACCTA AACGAGCTGT ACAGTTCATC CCTTTACTAG CTGGACTGGG AATCACCGCA 2640 

GCATTCACCA CCGGAGCTAC AGGCCTAGGT GTCTCCGTCA CCCAGTATAC AAAATTATCC 2700 

CATCAGTTAA TATCTGATGT CCAAGTCTTA TCCGGTACCA TACAAGATTT ACAAGACCAG 2760 

GTAGACTCGT TAGCTGAAGT AGTTCTCCAA AATAGGAGGG GACTGGACCT ACTAACGGCA 2820 

GAACAAGGAG GAATTTGTTT AGCCTTACAA GAAAAATGCT GTTTTTATGC TAACAAGTCA 2880 

GGAATTGTGA GAAACAAAAT AAGAACCCTA CAAGAAGAAT TACAAAAACG CAGGGAAAGC 2940 

CTGGCAACCA ACCCTCTCTG GACCGGGCTG CAGGGCTTTC TTCCGTACCT CCTACCTCTC 3000 

CTGGGACCCC TACTCACCCT CCTACTCATA CTAACCATTG GGCCATGCGT TTTCAGTCGC 3060 

CTCATGGCCT TCATTAATGA TAGACTTAAT GTTGTACATG CCATGGTGCT GGCCCAGCAA 3120 

TACCAAGCAC TCAAAGCTGA GGAAGAAGCT CAGGATTGAG GCGCCTAGTG TTGACAATTA 3180 

ATCATCGGCA TAGTATACGG CATAGTATAA TACGACTCAC TATAGGAGGG CCACCATGGC 3240 

CAAGTTGACC AGTGCCGTTC CGGTGCTCAC CGCGCGCGAC GTCGCCGGAG CGGTCGAGTT 3300 

CTGGACCGAC CGGCTCGGGT TCTCCCGGGA CTTCGTGGAG GACGACTTCG CCGGTGTGGT 33 60 

CCGGGACGAC GTGACCCTGT TCATCAGCGC GGTCCAGGAC CAGGTGGTGC CGGACAACAC 3420 

CCTGGCCTGG GTGTGGGTGC GCGGCCTGGA CGAGCTGTAC GCCGAGTGGT CGGAGGTCGT 3480 

GTCCACGAAC TTCCGGGACG CCTCCGGGCC GGCCATGACC GAGATCGGCG AGCAGCCGTG 3540 

GGGGCGGGAG TTCGCCCTGC GCGACCCGGC CGGCAACTGC GTGCACTTCG TGGCCGAGGA 3600' 

GCAGGACTGA NNNNCGGACC GGTCGACTTG TTAACTTGTT TATTGCAGCT TATAATGGTT 3660 

ACAAATAAAG CAATAGCATC ACAAATTTCA CAAATAAAGC ATTTTTTTCA CTGCATTCTA 3720 

GTTGTGGTTT GTCCAAACTC ATCAATGTAT CTTATCATGT CTGGATCCAG ATCTGGGCCC 3780 

ATGCGGCCGC GGATCGATNN NNACATGTGA GCAAAAGGCC AGCAAAAGGC CAGGAACCGT 3 840 

AAAAAGGCCG CGTTGCTGGC GTTTTTCCAT AGGCTCCGCC CCCCTGACGA GCATCACAAA 3900 

AATCGACGCT CAAGTCAGAG GTGGCGAAAC CCGACAGGAC TATAAAGATA CCAGGCGTTT 3960 

CCCCCTGGAA GCTCCCTCGT GCGCTCTCCT GTTCCGACCC TGCCGCTTAC CGGATACCTG 4020 

TCCGCCTTTC TCCCTTCGGG AAGCGTGGCG CTTTCTCAAT GCTCACGCTG TAGGTATCTC 4080 
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AGTTCGGTGT AGGTCGTTCG CTCCAAGCTG GGCTGTGTGC ACGAACCCCC CGTTCAGCCC 4140 
GACCGCTGCG CCTTATCCGG TAACTATCGT CTTGAGTCCA ACCCGGTAAG ACACGACTTA " 4200 

TCGCCACTGG CAGCAGCCAC TGGTAACAGG ATTAGCAGAG CGAGGTATGT AGGCGGTGCT 42*0 

ACAGAGTTCT TGAAGTGGTG GCCTAACTAC GGCTACACTA GAAGGACAGT ATTTGGTATC 43?0 

TGCGCTCTGC TGAAGCCAGT TACCTTCGGA AAAAGAGTTG GTAGCTCTTG ATCCGGCAAA 4380 

CAAACCACCG CTGGTAGCGG TGGTTTTTTT GTTTGCAAGC AGCAGATTAC GCGCAGAAAA 4440 

AAAGGATCTC AAGAAGATCC TTTGATCTTT TCTACGGGGT CTGACGCTCA GTGGAACGAA 4500 

AACTCACGTT AAGGGATTTT GGTCATGAGA TTATCAAAAA GGATCTTCAC CTAGATCCTT 4560 

TTAAATTAAA AATGAAGTTT TAAATCAATC TAAAGTATAT ATGAGTAAAC TTGGTCTGAC 462 0 

AGTTACCAAT GCTTAATCAG TGAGGCACCT ATCTCAGCGA TCTGTCTATT TCGTTCATCC 4680 

ATAGTTGCCT GACTCCCCGT CGTGTAGATA ACTACGATAC GGGAGGGCTT ACCATCTGGC 4740 

CCCAGTGCTG CAATGATACC GCGAGACCCA CGCTCAGCGG CTCCAGATTT ATCAGCAATA 4800 

AACCAGCCAG CCGGAAGGGC CGAGCGCAGA AGTGGTCCTG CAACTTTATC CGCCTCCATC 4860 

CAGTCTATTA ATTGTTGCCG GGAAGCTAGA GTAAGTAGTT CGCCAGTTAA TAGTTTGCGC 4920 

AACGTTGTTG CCATTGCTAC AGGCATCGTG GTGTCACGCT CGTCGTTTGG TATGGCTTCA 4980 

TTCAGCTCCG GTTCCCAACG ATCAAGGCGA GTTACATGAT CCCCCATGTT GTGCAAAAAA 5040 

GCGGTTAGCT CCTTCGGTCC TCCGATCGTT GTCAGAAGTA AGTTGGCCGC AGTGTTATCA 5100 

CTCATGGTTA TGGCAGCACT GCATAATTCT CTTACTGTCA TGCCATCCGT AAGATGCTTT 5160 

TCTGTGACTG GTGAGTACTC AACCAAGTCA TTCTGAGAAT AGTGTATGCG GCGACCGAGT 5220 

TGCTCTTGCC CGGCGTCAAT ACGGGATAAT ACCGCGCCAC ATAGCAGAAC TTTAAAAGTG 5280 

CTCATCATTG GAAAACGTTC TTCGGGGCGA AAACTCTCAA GGATCTTACC GCTGTTGAGA 5340 

TCCAGTTCGA TGTAACCCAC TCGTGCACCC AACTGATCTT CAGCATCTTT TACTTTCACC 5400 

AGCGTTTCTG GGTGAGCAAA AACAGGAAGG CAAAATGCCG CAAAAAAGGG AATAAGGGCG 5460 

ACACGGAAAT GTTGAATACT CATACTCTTC CTTTTTCAAT ATTATTGAAG CATTTATCAG 5520 

GGTTATTGTC TCATGAGCGG ATACATATTT GAATGTATTT AGAAAAATAA ACAAATAGGG 5580 

GTTCCGCGCA CATTTCCCCG AAAAGTGCCA CCTGACGTCT AAGAAACCAT TATTATCATG 5640 

ACATTAACCT ATAAAAATAG GCGTATCACG AGGCCCTTTC GTCTCGCGCG TTTCGGTGAT 5700 

GACGGTGAAA ACCTCTGACA CATGCAGCTC CCGGAGACGG TCACAGCTTG TCTGTAAGCG 5760 

GATGCCGGGA GCAGACAAGC CCGTCAGGGC GCGTCAGCGG GTGTTGGCGG GTGTCGGGGC 5820 

TGGCTTAACT ATGCGGCATC AGAGCAGATT GTACTGAGAG TGCAC ~~ 5 865 
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AGATCTCCCG 
AGCCAGTATC 
TAAGCTACAA 
CGTTTTGCGC 
AGTTATTAAT 
GTTACATAAC 
ACGTCAATAA 
TGGGTGGACT 
AGTACGCCCC 
ATGACCTTAT 
ATGGTGATGC 
TTTCCAAGTC 
GACTTTCCAA 
CGGTGGGAGG 
GCTTATCGAA 
CTTTTTCGCT 
TAGAATGGGA 
CTTTCTACTC 
TTCGTTAAAC 
AGATTGTAAG 
TACTTCAGCA 
GCATATAAAT 
CATCATCCTG 
ATACTCTGAG 
ACAGCTCCTG 
GGAACAGCAT 
TTAACCCGTG 
GCCCCCATCA 
CCAATGCCAC 
TATGTGATCT 
GCTGCAAATA 
GGCATACCGT 
GTGAAACCAC 
AGCGCGGTAA 
ACCTCTCCAA 
TAGTCCTAGA 
GACTGAGACT 
TCCTCAATAT 
CCCCCTCCCG 
CAGCCTCTAT 
TGCTAAACCT 
AAGAATGTTG 
GCACTTATAC 
TTACCCTATC 
AGGCCTTATG 
CTGGAACAAT 
ATCTAACCAC 
CCGATTATAT 
TGACCCTGGC 
CGGGGACCAC 
CAGACCTCAA 
CTGAAGTAGT 
TCTGCGCAGC 
ACAGCATGGC 
AAGGTTGGTT 
TCATGGGACC 
GATTAGTTCA 
AATACCACCA 
TCATCGGCAT 
AAGTTGACCA 
TGGACCGACC 
CGGGACGACG 
CTGGCCTGGG 
TCCACGAACT 
GGGCGGGAGT 
CAGGACTGAN 



ATCCCCTATG 
TGCTCCCTGC 
CAAGGCAAGG 
TGCTTCGCGA 
AGTAATCAAT 
TTACGGTAAA 
TGACGTATGT 
ATTTACGGTA 
CTATTGACGT 
GGGACTTTCC 
GGTTTTGGCA 
TCCACCCCAT 
AATGTCGTAA 
TCTATATAAG 
ATGTCGACTG 
ATTGTAAAAT 
AGATGTCCCT 
TGTTGACAAC 
TTTAGCTTGC 
TACTTTCTCT 
CAGTTTTAGA 
TCTGGCTGGC 
CCTTTCTCTT 
TCCAAACCGG 
GGCAACGTGC 
CAGGACCGAC 
GAAGTCCTTA 
GGTCTTTAAT 
CTCCCTTTTA 
GGTCGGAGAA 
CCCCGGAGGG 
AAAATCGGGG 
CGGACAGGCT 
CACCCCCTGG 
AGTATCCAAT 
ATTCACTGAT 
GTACCGGACA 
AGGGCCCCGC 
ACCCGTGCAG 
AGTCCCTGAG 
GGTAGAAGGA 
GCTGTGCTTA 
CAATCATTCT 
TGAAGTGACA 
TAACACCACC 
GTGGGCTTGT 
AGACTATTGT 
GTATGGTCAG 
CCTTCTGCTA 
TGCCCTAATC 
CGAAGTCGAA 
CCTACAGAAC 
CCTAAAAGAA 
CAAACTAAGG 
CGAAGGGCAG 
TCTAATAGTA 
ATTTGTTAAA 
GCTAAAGCCT 
AGTATACGGC 
GTGCCGTTCC 
GGCTCGGGTT 
TGACCCTGTT 
TGTGGGTGCG 
TCCGGGACGC 
TCGCCCTGCG 
NNNCGGACCG 



GTCGACTCTC 
TTGTGTGTTG 
CTTGACCGAC 
TGTACGGGCC 
TACGGGGTCA 
TGGCCCGCCT 
TCCCATAGTA 
AACTGCCCAC 
CAATGACGGT 
TACTTGGCAG 
GTACATCAAT 
TGACGTCAAT 
CAACTCCGCC 
CAGAGCTCTC 
AGAACTTCAG 
TCATGTTATA 
TGTATCACCA 
CATTGTCTCC 
ATTTGTAACG 
AATCACTTTT 
GAACAATTGT 
GTGGAAATAT 
TATGGTTACA 
GCCCCTCTGC 
TGGTTGTTGT 
ATGGAAGGTC 
ATGGTCATGG 
GTAACCTGGA 
GGAACTGTAC 
GAGTGGGACC 
AGAAAGCGGA 
TGTGGGGGGC 
TACTGGAAGC 
GACACGGGAT 
TCCTTCCAAG 
GCAGGAAAAA 
GGAACAGATC 
ATCCCCATTG 
ATCAGGCTCC 
ACTGCCCCAC 
GCCTATCAGG 
GTGTCGGGAC 
ACCGCCCCGG 
GGACAGGGCC 
CAAAGTGCCG 
AGCACTGGAT 
GTATTAGTTG 
CTTGAACAGC 
GGAGGATTAA 
AAAACCCAGC 
AAATCAATTA 
CGAAGAGGCC 
GAATGTTGTT 
GAAAGGCTTA 
TTTAATAGAT 
CTCTTACTGA 
GACAGGATCT 
ATAGAGTACG 
ATAGTATAAT 
GGTGCTCACC 
CTCCCGGGAC 
CATCAGCGCG 
CGGCCTGGAC 
CTCCGGGCCG 
CGACCCGGCC 
GTCGA 



AGTACAATCT 
GAGGTCGCTG 
AATTGCATGA 
AGATATACGC 
TTAGTTCATA 
GGCTGACCGC 
ACGCCAATAG 
TTGGCAGTAC 
AAATGGCCCG 
TACATCTACG 
GGGCGTGGAT 
GGGAGTTTGT 
CCATTGACGC 
TGGCTAACTA 
GGTGAGTTTG 
TGGAGGGGGC 
TGGACCCTCA 
TCTTATTTTC 
AATTTTTAAA 
TTTTCAAGGC 
TATAATTAAA 
TCTTATTGGT 
ATGATATACA 
TAACCATGTT 
GCTGTCTCAT 
CAGCGTTCTC 
GGGTCTATTT 
GAGTCACCAA 
AAGATGCCTT 
CTTCAGACCA 
CCCGGACTTT 
CAAGAGAGGG 
CCACATCATC 
GCTCCAAAAT 
GGGCTACTCG 
AGGCTAATTG 
CTATTACCAT 
GGCCTAATCC 
CCAGGCCTCC 
CTTCTCAACA 
CGCTTAACCT 
CTCCTTATTA 
CCAGCTGTAC 
TATGCATGGG 
GCTCAGGATC 
TGACTCCCTG 
AGCTCTGGCC 
GTACCAAATA 
CCATGGGAGG 
AGTTTGAGCA 
CCAACCTAGA 
TAGATTTGCT 
TTTATGCAGA 
ATCAGAGACA 
CCCCCTGGTT 
TCTTACTCTT 
CAGTAGTCCA 
AGCCATAGGG 
ACGACTCACT 
GCGCGCGACG 
TTCGTGGAGG 
GTCCAGGACC 
GAGCTGTACG 
GCCATGACCG 
GGCAACTGCG 



GCTCTGATGC 
AGTAGTGCGC 
AGAATCTGCT 
GTTGACATTG 
GCCCATATAT 
CCAACGACCC 
GGACTTTCCA 
ATCAAGTGTA 
CCTGGCATTA 
TATTAGTCAT 
AGCGGTTTGA 
TTTGGCACCA 
AAATGGGCGG 
GAGAACCCAC 
GGGACCCTTG 
AAAGTTTTCA 
TGATAATTTT 
TTTTCATTTT 
TTCACTTTTG 
AATCAGGGTA 
TGATAAGGTA 
AGAAACAACT 
CTGTTTGAGA 
CATGCCTTCT 
CATTTTGGCA 
AAAACCCCTT 
AAGAGTAGGG 
CCTGATGACT 
CCCAAGATTA 
GGAACCATAT 
TGACTTTTAC 
CTACTGTGGT 
ATGGGACCTA 
GGCTTGTGGC 
AGGGGGCAGA 
GGACGGGCCC 
GTTCTCCCTG 
CGTGATCACT 
TCAGCCTCCT 
ACCTGGGACG 
CACCAATCCC 
CGAAGGAGTA 
GGCCACTTCC 
AGCACTACCT 
CTACTACCTT 
CTTGTCCACC 
CAGAATAATT 
TAAGAGGGAG 
GATTGCAGCT 
GCTTCACGCC 
AAAGTCACTG 
CTTCCTAAAA 
CCACACGGGA 
AAAACTATTT 
TACCACCTTA 
TGGACCCTGC 
GGCTTTAGTC 
CGCCTAGTGT 
ATAGGAGGGC 
TCGCCGGAGC 
ACGACTTCGC 
AGGTGGTGCC 
CCGAGTGGTC 
AGATCGGCGA 
TGCACTTCGT 



CGCATAGTTA 
GAGCAAAATT 
TAGGGTTAGG 
ATTATTGACT 
GGAGTTCCGC 
CCGCCCATTG 
TTGACGTCAA 
TCATATGCCA 
TGCCCAGTAC 
CGCTATTACC 
CTCACGGGGA 
AAATCAACGG 
TAGGCGTGTA 
TGCTTAACTG 
ATTGTTCTTT 
GGGTGTTGTT 
GTTTCTTTCA 
CTGTAACTTT 
TTTATTTGTC 
TATTATATTG 
GAATATTTCT 
ACATCCTGGT 
TGAGGATAAA 
TCTTTTTCCT 
AGGATCGGCC 
AAAGATAAGA 
ATGGCAGAGA 
GGGCGTACCG 
TATTTTGATC 
GTCGGGTATG 
GTGTGCCCTG 
GAATGGGGTT 
ATCTCCCTTA 
CCCTGCTACG 
TGCAACCCTC 
AAATCGTGGG 
ACCCGCCAGG 
GGTCAACTAC 
CCTACAGGCG 
GGAGACAGGC 
GACAAGACCC 
GCGGTCGTGG 
CAACATAAGC 
AAAACTCACC 
GCAGCACCCG 
ACGATGCTCA 
TACCACTCCC 
CCAGTATCGT 
GGAATAGGGA 
GCTATCCAGA 
ACCTCGTTGT 
GAGGGAGGTC 
CTAGTGAGAG 
GAGTCAGGCC 
ATCTCCACCA 
ATTCTCAATC 
CTGACTCAAC 
TGACAATTAA 
CACCATGGCC 
GGTCGAGTTC 
CGGTGTGGTC 
GGACAACACC 
GGAGGTCGTG 
GCAGCCGTGG 
GGCCGAGGAG 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3925 
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